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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re the Application of : ^ 

Caroline Dean, et al . : 

Serial No. 09/890,220 : 

Filed July 27, 2001 : 

For: METHODS AND MEANS FOR : 

MODIFICATION OF PLANT : 

CHARACTERISTICS USING THE : 
VERNALIZATION GENE VRN2 

PRELIMINARY AMENDMENT 

Please amend the above-identified patent application as 
follows : 

In the specification: 

At page 88 of the application, on a separate sheet, please add 
the following abstract: 

--An isolated nucleic acid obtainable from the VRN2 locus 
of a plant, which nucleic acid encodes a polypeptide which is 
capable of affecting one or more physical characteristics of a 
plant into which the nucleic acid is introduced, the physical 
characteristics being selected from vernalization response, 
flowering time, leaf size, and/or shape or shade avoidance 
response; alleles, fragment and derivatives thereof; 
polypeptides encoded by such nucleic acids; antibodies to such 
peptides . 

In the claims: 

Please add the following new claims: 

60. An isolated nucleic acid which comprises a variant 
sequence which is a homologous variant of a nucleotide 
sequence selected from the group consisting of SEQ ID NO: 1, 
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SEQ ID: NO: 3, SEQ ID NO : 4, SEQ ID NO : 6, and SEQ ID NO : 7 
and which shares at least 50% identity therewith, said variant 
being capable of affecting one or more physical 
characteristics of a plant into which the nucleic acid is 
introduced, the physical characteristics being selected from 
vernalization response, flowering time, leaf size and/or shape 
or shade avoidance response, said isolated nucleic acid 
optionally comprising a sequence having promoter and/or 
regulatory function. 

61. A nucleic acid according to claim 60 wherein the variant 
is an allelic variant of a nucleotide sequence selected from 
the group consisting of SEQ ID NO : 1, SEQ ID: NO: 3, SEQ ID 
NO: 4, SEQ ID NO : 6, and SEQ ID NO : 7 or is a nucleic acid 
degenerative equivalent thereof . 

62. A nucleic acid according to claim 60 wherein the variant 
sequence is a VRN2 sequence obtainable from a plant species 
other than Arabidopsis thaliana. 

63. An isolated nucleic acid sequence according to claim 60, 
which is a derivative of said sequence by way of one or more 
of addition, insertion, deletion or substitution of a said 
nucleotide sequence, and wherein said derivative shares at 
least 50% homology with a nucleotide sequence selected from 
the group consisting of SEQ ID NO: 1, SEQ ID: NO: 3, SEQ ID 
NO: 4, SEQ ID NO : 6, and SEQ ID NO : 7 and wherein the 
derivative sequence is either capable of affecting one or more 
physical characteristics of a plant into which the nucleic 
acid is introduced, the physical characteristics being 
selected from vernalization response, flowering time, leaf 
size and/or shape or shade avoidance response or has promoter 
and/or regulatory function. 

64. An isolated nucleic acid which comprises a sequence 
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selected from the group consisting of SEQ ID NO: 1, SEQ ID: 
NO: 3, SEQ ID NO: 4, SEQ ID NO : 6, and SEQ ID NO : 7 which is a 
fragment of a sequence and either is capable of affecting one 
or more physical characteristics of a plant into which the 
nucleic acid is introduced, the physical characteristics being 
selected from vernalization response, flowering time, leaf 
size and/or shape or shade avoidance response or has promoter 
and/ or regulatory function. 

65. An isolated nucleic acid which comprises a sequence which 
is the complement of a sequence of claim 60. 

66. An isolated nucleic acid for use as a probe or primer 
which comprises a sequence that encodes an amino acid sequence 
that is partly, substantially or completely conserved between 
a VRN2 sequence of SEQ ID Nos . 2, 5 or 8 and at least one of 
the other sequences shown in Figure 8a or 8b, wherein said 
nucleic acid is 15 to 40 nucleotides in length. 

67. An isolated nucleic acid for use as a probe or primer 
which comprises a sequence that is partly, substantially or 
completely conserved between two or more of the VRN2 
nucleotide sequences selected from the group consisting of SEQ 
ID NO: 1, SEQ ID: NO: 3, SEQ ID NO: 4, SEQ ID NO: 6, and SEQ 
ID NO: 7 or the complements thereof, wherein said nucleic acid 
is 15 to 40 nucleotides in length said nucleic acid optionally 
comprising a sequence having promoter and or regulatory 
function. 

68. A nucleic acid according to claim 67 which is 19 to 28 
nucleotides in length. 

69. A pair of primers each comprising a nucleic acid 
according to claim 67 . 
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70. A pair of primers according to claim 69 which is selected 
from: 

VRN2-AP and VRN2-AJ; 
VRN2-A0 and VRN2-AS; and 
VRN2-AI and VRN2-AJ. 

71. A process for producing a nucleic acid which is a 
derivative according to claim 63 which process comprises the 
step of modifying a nucleotide sequence selected from the 
group consisting of SEQ ID NO: 1, SEQ ID: NO: 3, SEQ ID NO : 4, 
SEQ ID NO: 6, and SEQ ID NO : 7. 

72. A method for identifying or cloning a nucleic acid 
according to claim 60 in a database search, which method 
comprises using a nucleotide sequence of a probe or primer 
comprising a sequence that is partly, substantially or 
completely conserved between two or more of the VRN2 
nucleotide sequences selected from the group consisting of SEQ 
ID NO: 1, SEQ ID: NO: 3, SEQ ID NO : 4, SEQ ID NO : 6, and SEQ 
ID NO: 7 or the complements thereof, wherein said probe or 
primer is 15 to 40 nucleotides in length. 

73. A method for identifying or cloning a nucleic acid 
selected from the group consisting of SEQ ID NO: 1, SEQ ID: 
NO: 3, SEQ ID NO : 4, SEQ ID NO : 6, and SEQ ID NO : 7 , which 
method employs a probe or primer, said probe or primer 
comprising a sequence that encodes an amino acid sequence that 
is partly, substantially or completely conserved between a 
VRN2 sequence of SEQ ID Nos. 2, 5 or 8 and at least one of the 
other sequences shown in Figure 8a or 8b, wherein said nucleic 
acid is 15 to 40 nucleotides in length or a pair of primers 
selected from the group consisting of VRN2-AP and VRN2-AJ; 
VRN2-AO and VRN2-AS; and 

VRN2-AI and VRN2-AJ. 
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74. A method for determining the presence of a nucleic acid 
according claim 60 within the genetic make-up of a plant, 
which method employs a probe or primer, said probe or primer 
comprising a sequence that encodes an amino acid sequence that 
is partly, substantially or completely conserved between a 
VRN2 sequence of SEQ ID Nos . 2, 5 or 8 and at least one of the 
other sequences shown in Figure 8a or 8b, wherein said nucleic 
acid is 15 to 40 nucleotides in length, or a pair of primers 
selected from the group consisting of VRN2-AP and VRN2-AJ; 
VRN2-AO and VRN2-AS; and 

VRN2-AI and VRN2-AJ. 

75. A method according to claim 73, which comprises the steps 
of: 

(a) providing a preparation of nucleic acid from a plant 

cell ; 

(b) providing a nucleic acid molecule which is a probe, 
said probe comprising a sequence that encodes an amino acid 
sequence that is partly, substantially or completely conserved 
between a VRN2 sequence of SEQ ID Nos. 2, 5 or 8 and at least 
one of the other sequences shown in Figure 8a or 8b, wherein 
said probe is 15 to 40 nucleotides in length; 

(c) contacting nucleic acid in said preparation with 
said probe under conditions for hybridisation; and 

(d) identifying a nucleic acid selected from the group 
consisting of SEQ ID NO : 1, SEQ ID: NO: 3, SEQ ID NO : 4, SEQ 
ID NO: 6, and SEQ ID NO : 7, if present by its hybridisation 
with said nucleic acid probe. 

76. A method according to claim 73, which method comprises 
the steps of: 

(a) providing a preparation of nucleic acid from a plant 

cell ; 

(b) providing a pair of nucleic acid molecule primers 
suitable for PCR, at least one of said primers being a primer 
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which comprises a sequence that encodes an amino acid sequence 
that is partly, substantially or completely conserved between 
a VRN2 sequence of SEQ ID Nos . 2, 5 or 8 and at least one of 
the other sequences shown in Figure 8a or 8b, wherein said 
nucleic acid is 15 to 40 nucleotides in length; 

(c) contacting nucleic acid in said preparation with 
said primers under conditions for performance of PCR; 

(d) performing PCR and determining the presence or 
absence of an amplified PCR product. 

77. A method according to claim 76 wherein the pair of 
nucleic acid molecule primers are VRN2-AP and VRN2-AJ; 
VRN2-AO and VRN2 - AS ; and 

VRN2-AI and VRN2-AJ. 

78. A method of selecting a plant having a desired allele of 
the VRN2 gene, which method employs a probe or primer, said 
probe or primer comprising a sequence that encodes an amino 
acid sequence that is partly, substantially or completely 
conserved between a VRN2 sequence of SEQ ID Nos. 2, 5 or 8 and 
at least one of the other sequences shown in Figure 8a or 8b, 
wherein said nucleic acid is 15 to 40 nucleotides in length, 
or a pair of primers selected from the group consisting of 
VRN2-AP and VRN2-AJ; 

VRN2-A0 and VRN2-AS; and 
VRN2-AI and VRN2-AJ. 

79. A recombinant vector which comprises a nucleotide 
sequence selected from the group consisting of SEQ ID NO: 1, 
SEQ ID: NO: 3, SEQ ID NO : 4, SEQ ID NO : 6, and SEQ ID NO : 7 or 
a sequence degeneratively equivalent thereto. 

80. A vector according to claim 79 wherein the nucleic acid 
comprised in the vector is capable of regulating one or more 
genes involved in the transition from vegetative to 
reproductive growth and/or capable of regulating one or more 
genes involved in the determination of leaf size and/or shape. 
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81. A vector according to claim 80 wherein the nucleic acid 
is operably linked to a promoter for transcription in a host 
cell, wherein the promoter is optionally an inducible 
promoter . 

82. A vector according to claim 81 wherein the promoter is a 
constitutive promoter. 

83. A vector according to any one of claims 79 which is a 
plant vector. 

84. A method which comprises the step of introducing a vector 
according to claim 79 into a host cell such as to transform 
the host cell. 

85. A host cell which is transformed with a heterologous 
nucleic acid selected from the group consisting of SEQ ID NO: 
1, SEQ ID: NO: 3, SEQ ID NO : 4, SEQ ID NO : 6, and SEQ ID NO : 7 
and sequences degeneratively equivalent thereto. 

86. A host cell according to claim 85 which is a plant cell, 
optionally present in a plant. 

87. A method for producing a transgenic plant, which method 
comprises the steps of: 

(a) performing a method according to claim 84; and 

(b) regenerating a plant from the transformed host cell. 

88. A transgenic plant which is obtainable by the method of 
claim 87, or which is a clone, or selfed or hybrid progeny or 
other descendant of said transgenic plant, which in each case 
includes the plant cell transformed with a heterologous 
nucleic acid selected from the group consisting of SEQ ID NO: 
1, SEQ ID: NO: 3, SEQ ID NO : 4, SEQ ID NO : 6, and SEQ ID NO : 7 
and sequences degeneratively equivalent thereto . 
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89. A plant according to claim 88 which is an agricultural or 
horticultural plant. 



90. A plant according to claim 89 selected from the list 
consisting of: rice, maize, wheat, barley, oats, rye, oil seed 
rape, sugar beet, sunflower, soybean, sorghum, lettuce, 
endive, cabbage, broccoli, cauliflower, carnation, geranium, 
tobacco, cotton, canola, tomato, mango, peach, apple, pear, 
strawberry, banana, melon, carrot, onion, pea, celery. 

91. A plant according to claim 90 which is selected from 
tobacco, oil seed rape, rice and wheat. 

92. A part or propagule from a plant which is a clone, or 
selfed or hybrid progeny or other descendant of said 
transgenic plant, which in each case includes the plant cell 
transformed with a heterologous nucleic acid selected from the 
group consisting of SEQ ID NO: 1, SEQ ID: NO: 3, SEQ ID NO : 4, 
SEQ ID NO: 6, and SEQ ID NO : 7 and sequences degeneratively 
equivalent thereto . 

93. An isolated polypeptide which is encoded by a nucleotide 
sequence according to claim 1. 

94. A polypeptide according to claim 93 which comprises an 
amino acid sequence which consists of the sequence of SEQ ID 
Nos. 2, 5 or 8. 

95. A polypeptide which is a fragment of a polypeptide of 
claim 94, having at least nine contiguous amino acids. 

96. An isolated polypeptide which consists of the sequence of 
SEQ ID Nos . 2 , 5 or 8 . 

97. An isolated polypeptide which is a fragment of a 
polypeptide according to claim 96 and which comprises at least 
9 contiguous amino acids of that polypeptide. 
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98. A fragment according to claim 97 which comprises amino 
acids 90 to 111 (zinc finger motif), amino acids 63 to 132 
(amino terminal region) , amino acids 263 to 366 (carboxy 
terminus) or amino acids 2 63 to 32 8 (activation domain) . 

99. A polypeptide according to claim 93 which is capable of 
affecting one or more physical characteristics of a plant 
expressing said peptide, the physical characteristics being 
selected from vernalization response, flowering time, leaf 
size and/or shape or shade avoidance response. 

100. An isolated nucleic acid which encodes a fragment 
according to claim 97. 

101 . An isolated nucleic acid which is the complement of the 
nucleic acid of claim 100. 

102. A method of making the polypeptide of a sequence selected 
from the group consisting of SEQ ID NO: 2, 5, or 8, which 
method comprises the step of causing or allowing expression 
from a nucleic acid encoding SEQ ID NOS 2, 5, or 8 . 

103. An antibody which has specific binding affinity for a 
polypeptide according to claim 93. 

104. A polypeptide which comprises the antigen-binding site of 
the antibody of claim 103. 

105. A method for affecting a physical characteristic of a 
plant selected from vernalization response, flowering time, 
leaf size and/ or shape or shade avoidance response, which 
method comprises either the step of: 

(i) causing or allowing transcription from a nucleic 
acid which comprises a sequence selected from the group 
consisting of SEQ ID NO: 1, SEQ ID: NO: 3, SEQ ID NO : 4, SEQ 
ID NO: 6, and SEQ ID NO : 7 or a fragment of said sequence 
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which is capable of affecting one or more physical 
characteristics of a plant into which the nucleic acid is 
introduced, the physical characteristics being selected from 
vernalization response, flowering time, leaf size and/or shape 
or shade avoidance response or has promoter and/ or regulatory 
function . 

106. A method according to claim 105 which method comprises 
causing or allowing transcription from said nucleic acid 
thereby reducing VRN2 expression by co- suppression . 



The purpose of this preliminary amendment is insert an 
abstract of the disclosure into the specification and to 
reinstate claims canceled in the first preliminary amendment. 

Favorable consideration leading to prompt allowance of 
the present application is respectfully requested. 



REMARKS 



Respectfully submitted, 
DANN, DORFMAN, HERRELL AND SKILLMAN 
A Professorial Corporation 




Telephone: (215) 563-4100 
Facsimile: (215) 563-4044 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re the Application of 

Caroline Dean, et al . : 

Serial No. Not yet assigned 
(PCT/GB00/00248) : 
Int. App. filed 01/28/00 : 

For: METHODS AND MEANS FOR : 

MODIFICATION OF PLANT : 

CHARACTERISTICS USING THE : 

VERNALIZATION GENE VRN2 : 

PRELIMINARY AMENDMENT 

Before calculation of the filing fee, please amend the 
above-identified patent application as follows: 

In the specification: 

At page 1, line 3, please insert the following priority claim: 
--The present application is the U.S. National Phase 
of PCT/GB00/00248 filed January 28, 2000 which claims priority 
to GB 9901927.5 filed January 28, 1999 under 35 U.S.C. 
§119 (e) , the entire disclosure of which is incorporated by 
reference herein . -- 

In the claims: 

3. (Amended) A nucleic acid according to claim 1 which 
comprises a VRN2 nucleotide sequence which encodes a 
polypeptide of SEQ ID No . 2. 

5. (Amended) A nucleic acid according to claim 1 which 
comprises a VRN2 nucleotide sequence which encodes a 
polypeptide of SEQ ID No . 5. 

7. (Amended) A nucleic acid according to claim 1 which 
comprises a VRN2 nucleotide sequence which encodes a 
polypeptide of SEQ ID No . 8. 



Please cancel claims 11-59. 



REMARKS 



The purpose of this preliminary amendment is insert a 
priority claim into the application and to remove existing 
multiple dependencies from the claims. Claims 11-59 have been 
canceled. Applicants intend to submit claims directed to the 
subject matter of canceled claims 11-59 via preliminary 
amendment in the very near future . 

Favorable consideration leading to prompt allowance of 
the present application is respectfully requested. 



Respectfully submitted, 
DANN, DORFMAN, HERRELL AND SKILLMAN 
A Professional Corporation 




Telephone: (215) 563-4100 
Facsimile; (215) 563-4044 



Kathleen D. Rigaut, Ph.gJ. , J.D 
PTO Registration No. 43,047 



Enclosures: Marked up draft of amended claims. 
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Marked up draft of claims showing where amendments have been 
made . 

3. (Amended) A nucleic acid according to claim 1 [or claim 
2] which comprises a VRN2 nucleotide sequence which encodes a 
polypeptide of SEQ ID No . 2 . 

5. (Amended) A nucleic acid according to claim 1 [or claim 
2] which comprises a VRN2 nucleotide sequence which encodes a 
polypeptide of SEQ ID No . 5 . 

7. (Amended) A nucleic acid according to claim 1 [or claim 
2] which comprises a VRN2 nucleotide sequence which encodes a 
polypeptide of SEQ ID No. 8. 
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METHODS AND MEANS FOR MODIFICATION OF PLANT CHARACTERISTICS 
USING THE VERNALIZATION GENE VRN2 



The present invention relates generally to modification of 
5 characteristics of plants, especially vernalization response, 
flowering time, leaf shape and/or shade avoidance response, 
and has arisen on the basis of cloning of the VRN2 gene and 
mutant alleles of Arabidopsis thaliana, and identification of 
homologues in other species . 

10 

In different embodiments, the present invention provides for 
manipulation of flowering time and/or other characteristics of 
plants, e.g. by up or down regulating VRN2 gene expression. 
The present invention also provides for modification of the 
&. 5 extent of alteration of a relevant plant characteristic 
through the use of gene alleles, mutants and variants. 

Plants must integrate a wide variety of environmental signals 
in order to maximize their reproductive success. Part of this 

20 integration must involve perception of the seasons, both to 
ensure the plant flowers during the correct season (for which 
it is adapted) and to synchronise its flowering with other 
members of its own species, to increase the chances of cross- 
fertilization. Arabidopsis thaliana serves as a model plant, 

25 for it exhibits responses to a wide variety of environmental 
stimuli that are observed in many species. Flowering in 
naturally occurring strains (ecotypes) of Arabidopsis can be 
promoted either by long days {increased photoperiod) or by 
vernalization, a long cold treatment that mimics the cold of 

30 winter. While many aspects of the photoperiodic response are 
now understood, the vernalization pathway has received 
relatively less attention. The inventors have used a late 
flowering, vernalization responsive mutant of Arabidopsis, the 
fca mutant, as a background in which to isolate mutants that 

35 exhibit a reduced vernalization response, the VRN mutants. 
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Vernalization is the low temperature promotion of flowering. 
It can also be thought of as the cold aspect of the winter 
Season, which also includes reduced daylight hours. Many 
species of plants that grow in temperate or cooler climes have 
an obligate requirement for vernalization in order to flower. 
Such plants typically germinate in autumn, and over-winter as 
vegetative plants, and flower in milder conditions of spring. 
Vernalization thus acts as a cue, to allow plants to sense the 
seasons, and to time their flowering to maximise their chance 
of reproductive success. 

Species for which flowering is important to crop production 
are numerous, essentially all crops which are grown from seed, 
with important examples being the cereals, rice and maize, 
probably the most agronomically important in warmer climatic 
zones, and wheat, barley, oats and rye in more temperate 
climates. Important seed products are oil seed rape, sugar 
beet, maize, sunflower, soybean and sorghum. Many crops which 
are harvested for their roots are, of course, grown annually 
from seed and the production of seed of any kind is very 
dependent upon the ability of the plant to flower, to be 
pollinated and to set seed. In horticulture, control of the 
timing of flowering is important. Horticultural plants whose 
flowering may be controlled include lettuce, endive and 
vegetable brassicas including cabbage, broccoli and 
cauliflower, and carnations and geraniums. 

Arabidopsis thaliana is a facultative long day plant, 
flowering early under long days and late under short days. 
Because it has a small, well-characterized genome, is 
relatively easily transformed and regenerated and has a rapid 
growing cycle, Arabidopsis is an ideal model plant in which to 
study flowering and its control. 

In addition to cloning the VRN2 gene, the inventors have 
unexpectedly found indication that VRN2 is a transcription 
factor, which in itself opens several exciting avenues for 
application of the present invention. Without being bound by 
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theory or in anyway limiting the scope of the present 
invention, VRN2 may be required for a normal vernalization 
response because it acts as a regulator of genes that 
ultimately lead to the transition from vegetative to 
5 reproductive growth. In such a model, cold, or a downstream 
molecule involved in cold perception, may regulate the 
activity or expression of the VRN2 protein, which in turn may 
regulate the expression of a large number of genes that 
ultimately lead to flowering. Furthermore, the shade 
10 avoidance phenotype exhibited by the vm2-l mutant, as 

demonstrated experimentally below, provides indication that 
VRN2 also plays a role in regulating leaf shape, particularly 
in response to increased far-red light. Together, these two 
processes affected by a deficiency or reduction of VRN2 
15 activity provide for a number of approaches of agronomic 
interest . 



First, forced expression of VRN2 (for example under the 
control of a strong and constitutive promoter, such as the Ca 

20 MV 35 S promoter) in a wild- type background may be used alter 
the vernalization requirement of a plant prior to flowering. 
As a large number of commercial cultivars of several species, 
including (diploid) wheat, barley, and sugar beet, have a 
requirement for vernalization to flower, modification of this 

25 requirement, by reducing the duration of vernalization 

required, or changing the optimum temperature, or abrogating 
the requirement altogether, is of agronomic usefulness. For 
instance, a winter crop that can be sown and left in the 
ground for a shorter period than usual (i.e. a reduced 

30 vernalization time) may benefit from reduced risk of damage 

associated with severe winter weather conditions, as the crops 
are exposed to winter conditions for a shorter time. 



Second, down- regulation of VRN2 expression, for instance by 
35 means of an antisense VRN2 cDNA, may be used to recapitulate 
the reduction in shade avoidance phenotype observed in vrn2-l 

mutants. This may be used in situations where crowding of the 
crop is a problem. Based on experimental evidence provided 
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herein on the phenotype of vrn2-l mutants, such plants are' 
expected to exhibit less of a response to such conditions, and 
to produce leaves that are essentially normal i.e. as if they 
had not been grown in dense or crowded conditions. The normal 
shade avoidance phenotype is a reduction in leaf size, which 
reduces shade in overcrowded conditions; vrn2-l mutants, 

defective in VRN2 production, show less reduction in leaf size 
under conditions which would normally lead to the shade 
avoidance phenotype. This effect can therefore be reproduced 
for example by using antisense VRN2 cDNA to downregulate VRN2 
expression, preventing or reducing the leaf avoidance response 
even in overcrowded conditions. 

Third, the individual isolated domains of the VRN2 protein may 
be used in their own right. DNA binding of the zinc finger of 
VRN2 may be used to direct or control gene expression in a 
precise manner. The VRN2 zinc finger may recognize specific 
DNA sequences that represent elements in the promoters of its 
normal target genes. By creating fusion proteins, comprising 
the DNA binding (zinc finger) domain of VRN2 , and an 
activation or repression domain from a heterologous protein, 
the expression of VRN2 target genes may be controlled. This 
allows for a precise control of the expression of those genes 
that are normally targets of VRN2 . Given that such genes, 
alone or in combination, ultimately control the transition to 
flowering (usually following vernalization) , their directed 
expression in other conditions may also be used to elicit 
changes in flowering and/or one or more other plant 
characteristics. The expression of (normally far-red 
responsive) target genes also be controlled using VRN2 fusion 
proteins containing the zinc finger of VRN2 . Furthermore, the 
use of the zinc finger domain of VRN2 in conventional SELEX or 
one -hybrid experiments may be used to reveal the target genes 
or DNA sequences normally bound by VRN2 . 

The acidic activation domain of VRN2 may be used to regulate 
the activity of a fusion protein, including a DNA-binding 
protein of known specificity, and the activation domain of 
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VRN2 . This allows for regulation of target genes of other DNA 
binding proteins involved in flowering, or of target genes in 
completely unrelated processes . 

5 The inventors have cloned, characterised and manipulated the 
VRN2 gene of Arabidopsis tha.lia.na, both Columbia and Landsberg 
erecta types, and identified alternatively spliced and mutant 
forms, also homologues in other species. 

10 In the light of the inventors'' experimental work, a first 
aspect of the present invention provides a nucleic acid 
isolate encoding a polypeptide including a VRN2 amino acid 
sequence shown herein (e.g. SEQ ID NO: 2; SEQ ID NO: 5), which 
may include a coding sequence shown herein (e.g. SEQ ID NO: 1; 

15 SEQ ID NO: 4) . 

Allelic forms and alternatively spliced forms of the gene have 
been identified. Such polypeptides and encoding nucleic acid 
(e.g. as in SEQ ID NO: 8, encoded by SEQ ID NO: 7) are each 
20 further provided as an aspect of the invention, as are 
polypeptides and nucleic acid including the mutations 
identified herein. 

Nucleic acid according to the present invention may have the 
25 sequence of the VRN2 gene of Arabidopsis thaliana as indicated 
in SEQ ID NO: 1, SEQ ID NO: 3 (Landsberg erecta genomic 
sequence) , SEQ ID NO: 4 or SEQ ID NO: 6 {Columbia genomic 
sequence), or be a mutant, variant, derivative or allele or a 
homologue of the sequence provided. Preferred mutants, 
30 variants, derivatives and alleles are those which encode a 
protein which retains a functional characteristic of the 
protein encoded by the wild- type gene, especially the ability 
to alter vernalization response, flowering time, leaf shape 
and/or shade avoidance response. 

35 

A mutant, variant, derivative, allele or homologue in 
accordance with the present invention may have the ability to 
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affect a physical characteristic of a plant, especially 
vernalization response, flowering time, leaf shape and/or 
: shade avoidance response, as discussed. 

5 Polynucleotides which are not 100% identical to the sequences 
shown herein but fall within the scope of the invention can be 
obtained in a number of ways . 

Other VRN2 variants (for example allelic forms) of the gene 
10 described herein may be obtained for example by probing cDNA 
or genomic DNA libraries made from Arabidopsis thaliana plants 
or cells . 

In addition, other plant, monocot or dicot, homologues of the 
{15 gene may be obtained. Such sequences may be obtained by 

making or obtaining cDNA libraries made from dividing cells or 
tissues or genomic DNA libraries from other plant species, and 
probing such libraries with probes comprising all or part of a 
nucleic acid of the invention under conditions of medium to 
;20 high stringency (for example for hybridization on a solid 

support (filter) overnight incubation at 42°C in a solution 
containing 50% formamlde, 5xSSC (750 mM NaCl, 75 mM sodium 
citrate), 50 mM sodium phosphate (pH7.6), 5x Denhardt's 
solution, 10% dextran sulphate and 20 fig /ml salmon sperm DNA, 
25 followed by washing in 0.03M sodium chloride and 0.03M sodium 
citrate (i.e. 0.2x SSC) at from about 50°C to about 60°C) . 

Thus the present invention provides an isolated nucleic acid 
which hybridizes to the nucleotide sequence shown in a figure 
30 herein under the abovementioned hybridization and washing 
conditions. Such a nucleic acid is suitable for use as a 
probe for detecting the VRN2 gene, for example in Southern 
blots . 

35 Suitable probe and primer sequences are disclosed herein. 

Alternatively, polynucleotides of the invention may be 
obtained by site directed mutagenesis of the sequences of 
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shown in the figures or allelic variants thereof. This may be 
useful where for example silent codon changes are required to 
sequences to optimise codon preferences for a particular host 
cell in which the polynucleotide sequences are being 
5 expressed. Other sequence changes may be desired in order to 
introduce restriction enzyme recognition sites, or to alter 
the property or function of the polypeptides encoded by the 
polynucleotides. Further changes may be desirable to 
represent particular coding changes which are required to 
10 provide, for example, conservative substitutions. 

In the context of cloning, it may be necessary for one or more 
gene fragments to be ligated to generate a full-length coding 
sequence. Also, where a full-length encoding nucleic acid 

15 molecule has not been obtained, a smaller molecule 

representing part of the full molecule, may be used to obtain 
full-length clones. Inserts may be prepared from partial cDNA 
clones and used to screen cDNA libraries. The full-length 
clones isolated may be subcloned into expression vectors and 

20 activity assayed by transfection into suitable host cells, 
e.g. with a reporter plasmid. 

The present invention also extends to nucleic acid comprising 
transcriptional control sequences for the VEN2 gene. Such 

25 control sequences will be found 5 1 to the open reading frame 
of the gene and are obtainable by probing a genomic DNA 
library with a nucleic acid of the invention, selecting a 
clone which hybridizes under conditions of medium to high 
stringency, and sequencing the clone 5' to the open reading 

30 frame of the gene. Where only a small amount of sequence is 
present in the 5* region, this sequence may be used to reprobe 
the library to genome walk further upstream. Analysis of the 
upstream region will reveal control regions for gene 
expression including control regions common to many genes (i.e 

35 TATA and CAAT boxes) and other control regions, usually 
located from 1 to 10,000, such as 1 to 1000 or 50 to 500 
nucleotides upstream of the start of transcription. 
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To confirm that such regions are the control regions of the 
gene, they may be linked to a reported gene (such as 3- 
galactosidase) and tested in any suitable in vitro or in vivo 
system. For example the construct of the control region {e.g. 
5 comprising 50 to 500 nucleotides upstream of the start of 

transcription) and the reporter gene may be used to produce a 
transgenic plant and the pattern of expression, both spatially 
and development ally, may be compared with that of the VRN2 
gene. Where substantially similar patterns of expression are 
10 found, this shows that the construct comprises substantially 
all of the control region of the wild type gene. 

SEQ ID NO: 3 and SEQ ID NO: 6 show the nucleotide sequence of 
the VRN2 genomic region including promoter, respectively for 
15 Landsberg erecta and Columbia ecotypes of Arabidopsis 
thaliana, also 3 1 regulatory elements. 

The control region may be mutated to identify specific 
subregions responsible for transcriptional control. This may 
2 0 be achieved by a number of techniques well known in the art as 
such, including DNase protection footprint assays, in which 
the control region is brought into contact with an extract 
from a cell in which the VRN2 gene is actively expressed, and 

the regions of the control region which bind factors in that 
25 extract is determined. 

Isolated nucleic acid comprising such control regions 
obtainable by such a method form a further aspect of the 
present invention . 

30 

The present invention further extends to genomic DNA exon 
sequences found between the introns encoding a VRN2 gene in 
plant . Such exon sequences may be obtained in a manner 
analogous to that described above for the transcriptional 
35 control sequences, with the appropriate genome walking being 
conducted between the intron sequences. The locations of the 
exons may be determined by comparing genomic and cDNA 
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sequences of the gene, observing where the sequences line up 
and diverge, and looking for consensus splice sequences which 
define intron/exon boundaries. 

5 As noted above, changes to a sequence, to produce a mutant, 
variant or derivative, may be by one or more of addition, 
insertion, deletion or substitution of one or more nucleotides 
in the nucleic acid, leading to the addition, insertion, 
deletion or substitution of one or more amino acids in the 
10 encoded polypeptide. Of course, changes to the nucleic acid 
which make no difference to the encoded amino acid sequence 
^; { "degeneratively equivalent") are included. 

2;; Preferred nucleic acid sequences according to the present 

pyL5 invention are shown herein, for instance see SEQ ID NO: 1 and 

il l SEQ ID NO: 4, of which the respective predicted encoded amino 

7" acid sequences of polypeptides according to the present 

N; invention are shown in SEQ ID NO: 2 and SEQ ID NO: 5. 

|t20 A mutant, allele, variant or derivative amino acid sequence in 
accordance with the present invention may include within a 
sequence shown herein a single amino acid change with respect 
to the sequence shown with the relevant SEQ ID NO: or in the 
relevant figure, or 2, 3, 4, 5, 6, 7, 8, or 9 changes, about 

25 10, 15, 20, 30, 40 or 50 changes, or greater than about 50, 

60, 70, 80 or 90 changes. In addition to one or more changes 
within the amino acid sequence shown in the relevant figure, a 
mutant, allele, variant or derivative amino acid sequence may 
include additional amino acids at the C-terminus and/or N- 

30 terminus. 

A sequence related to a sequence specifically disclosed herein 
shares homology with that sequence. Homology may be at the 
nucleotide sequence and/or amino acid sequence level. 
35 Preferably, the nucleic acid and/or amino acid sequence shares 
homology with the coding sequence or the sequence encoded by a 
nucleotide sequence shown herein, for instance SEQ ID NO: 2 or 
SEQ ID NO: 5, preferably at least about 50%, or 60%, or 70%, 
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or 80% homology, most preferably at least about 90%, 95%, 96%, 
97%, 98% or 99% homology. 

As is well -understood, homology at the amino acid level is 
5 generally in terms of amino acid similarity or identity. 
Similarity allows for "conservative variation", i.e. 
substitution of one hydrophobic residue such as isoleucine, 
valine, leucine or methionine for another, or the substitution 
of one polar residue for another, such as arginine for lysine, 
10 glutamic for aspartic acid, or glutamine for asparagine. 
Similarity may be as defined and determined by the TBLASTN 
= program, of Altschul et al . (1990) J. Mol . Biol. 215: 403-10, 
£i which is in standard use in the art, or, and this may be 
% preferred, either of the standard programs Best Fit and GAP, 
q!5 which are part of the Wisconsin Package, Version 8, September 
^ 1994, (Genetics Computer Group, 575 Science Drive, Madison, 
G Wisconsin, USA, Wisconsin 53711) . BestFit makes an optimal 
s alignment of the best segment of similarity between two 

j=y sequences. Optimal alignments are found by inserting gaps to 
H*20 maximize the number of matches using the local homology 
jjr algorithm of Smith and Waterman (Advances in Applied 
M ; Mathematics (1981) 2,- pp. 482-489). GAP uses the Needleman and 
Wunsch algorithm to align two complete sequences that 
maximizes the number of matches and minimizes the number of 
25 gaps. Generally, the default parameters are used, with a gap 
creation penalty - 12 and gap extension penalty = 4. 

Homology is generally over the full-length of the relevant 
sequence shown herein, that is unless stated otherwise, or it 
30 may be over a contiguous sequence of about or greater than 
about 20, 25, 30, 33, 40, 50, 67, 133, 167, 200, 233, 267, 
300, 333, 400, 450, 500, 550, 600 or more amino acids or 
codons, compared with the relevant amino acid sequence or 
nucleotide sequence as the case may be. 

35 

In highly preferred embodiments, all percentage homologies 
referred to herein refer to percentage sequence identity. 
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In this context , percent (%) amino acid sequence identity with 
respect to a particular reference sequence is defined as the 
percentage of amino acid residues in a candidate sequence that 
are identical with the amino acid residues in the reference 
sequence, after aligning the sequences and introducing gaps, 
if necessary, to achieve the maximum percent sequence 
identity, and not considering any conservative substitutions 
as part of the sequence identity. 

The % identity values used herein may be determined by WU- 
BLAST-2 which was obtained from [Altschul et al . , Methods in 
Enzvmoloqv, 266 :460-480 (1996) ; http: //blast . wustl/edu/blast/ 
README .html] . WU-BLAST-2 uses several search parameters, most 
of which are set to the default values. The adjustable 
parameters are set with the following values: overlap span 
=1, overlap fraction- 0.125, word threshold (T) = 11. The 
HSPS and HSPS2 parameters are dynamic values and are 
established by the program itself depending upon the 
composition of the particular sequence and composition of the 
particular database against which the sequence of interest is 
being searched; however, the values may be adjusted to 
increase sensitivity . 

A% amino acid sequence identity value is determined by the 
number of matching identical residues divided by the total 
number of residues of the "longer" sequence in the aligned 
region. The "longer" sequence is the one having the most 
actual residues in the aligned region (gaps introduced by WU- 
Blast-2 to maximize the alignment score are ignored) . 

In a similar manner, percent (%) nucleic acid sequence 
identity with respect to a reference nucleic acid sequence is 
defined as the percentage of nucleotide residues in a 
candidate sequence that are identical with the nucleotide 
residues in the reference sequence. The identity values may 
be determined by the BLASTN module of WU-BLAST-2 set to the 



WO 00/44918 PCT/GB00/00248 

12 

default parameters, with overlap span and overlap fraction set 
to 1 and 0.125, respectively. 

Nucleic acid according to the present invention may consist 
5 essentially of or consist of the relevant coding sequence. 
Nucleic acid according to the present invention may include a 
promoter or other regulatory sequence as discussed further 
elsewhere herein, and such regulatory sequence may be 
heterologous to the coding sequence, that is to say not 

10 naturally operably linked therewith. Nucleic acid according 
to the present invention may be cDNA or lacking one or more 
introns which occur naturally, or may be«in any non-naturally 
occurring form. A coding sequence in accordance with the 
present invention may be included with a larger nucleic acid 

15 molecule of less than about 10,000 nucleotides, less than 

about 5,000 nucleotides or less than about 2,000 nucleotides. 

Also provided by an aspect of the present invention is nucleic 
acid including or consisting essentially of a sequence of 

20 nucleotides complementary to a nucleotide sequence 

hybridisable with any encoding sequence provided herein. 
Another way of lookirig at this would be for nucleic acid 
according to this aspect to be hybridisable with a nucleotide 
sequence complementary to any encoding sequence provided 

25 herein. Of course, DNA is generally double -stranded and 

blotting techniques such as Southern hybridisation are often 
performed following separation of the strands without a 
distinction being drawn between which of the strands is 
hybridising. Preferably the hybridisable nucleic acid or its 

30 complement encode a product able to influence a physical 
characteristic of a plant, particularly vernalization 
response, flowering time, leaf shape and/or shade avoidance 
response, e.g. in Arabidopsis thaliana. Preferred conditions 
for hybridisation are familiar to those skilled in the art, 

35 but are generally stringent enough for there to be positive 
hybridisation between the sequences of interest to the 
exclusion of other sequences . 
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The nucleic acid, which may contain for example DNA encoding a 
polypeptide including the amino acid sequence of VRN2 or other 
polypeptide disclosed herein, as genomic or cDNA, may be in 
the form of a recombinant and preferably replicable vector, 
5 for example a plasmid, cosmid, phage or Agrobacterium binary 

vector. The nucleic acid may be under the control of an 
appropriate promoter or other regulatory elements for 
expression in a host cell such as a microbial, e.g. bacterial, 
or plant cell. In the case of genomic DNA, this may contain 
10 its own promoter or other regulatory elements and in the case 
of cDNA this may be under the control of an appropriate 

% promoter or other regulatory elements for expression in the 

tti host cell. 

flO.5 A vector including nucleic acid according to the present 
Ji; invention need not include a promoter or other regulatory 
s sequence, particularly if the vector is to be used to 

introduce the nucleic acid into cells for recombination into 
y, the genome. 

rf Those skilled in the art are well able to construct vectors 
and design protocols for recombinant gene expression. 
Suitable vectors can be chosen or constructed, containing 
appropriate regulatory sequences, including promoter 
25 sequences, terminator fragments, polyadenylation sequences, 
enhancer sequences, marker genes and other sequences as 
appropriate. For further details see, for example, Molecular 

Cloning: a Laboratory Manual: 2nd edition, Sambrook et al, 

1989, Cold Spring Harbor Laboratory Press. Many known 
30 techniques and protocols for manipulation of nucleic acid, for 
example in preparation of nucleic acid constructs, 
mutagenesis, sequencing, introduction of DNA into cells and 
gene expression, and analysis of proteins, are described in 
detail in Current Protocols in Molecular Biology, Second 

35 Edition, Ausubel et al . eds . , John Wiley & Sons, 1992. The 

disclosures of Sambrook et al . and Ausubel et al . are 

incorporated herein by reference. Specific procedures and 
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vectors previously used with wide success upon plants are 
described by Bevan (Nucl. Acids Res. 12, 8711-8721 (1984)) and 
Guerineau and Mullineaux (1993) (Plant transformation and 
expression vectors. In: Plant Molecular Biology Labfax (Croy 
5 RRD ed) Oxford, BIOS Scientific Publishers, pp 121-148) . 

Selectable genetic markers may be used consisting of chimaeric 
genes that confer selectable phenotypes such as resistance to 
antibiotics such as kanamycin, hygromycin, phosphinotricin, 
10 chlorsulfuron, methotrexate, gentamycin, spectinomycin, 
imidazolinones and glyphosate. 

Nucleic acid molecules and vectors according to the present 
invention may be provided isolated and/or purified from their 

15 natural environment, in substantially pure or homogeneous 

form, or free or substantially free of nucleic acid or genes 
of the species of interest or origin other than the sequence 
encoding a polypeptide with the required function. Nucleic 
acid according to the present invention may include cDNA, 

20 RNA r genomic DNA and may be wholly or partially synthetic. 

The term "isolate" encompasses all these possibilities. Where 
a DNA sequence is specified, e.g. with reference to a figure, 
unless context requires otherwise the RNA equivalent, with U 
substituted for T where it occurs, is encompassed. 

25 

When introducing a chosen gene construct into a cell, certain 
considerations must be taken into account, well known to those 
skilled in the art. The nucleic acid to be inserted should be 
assembled within a construct which contains effective 

3 0 regulatory elements which will drive transcription. There must 
be available a method of transporting the construct into the 
cell. Once the construct is within the cell membrane, 
integration into the endogenous chromosomal material either 
will or will not occur. Finally, as far as plants are 

3 5 concerned the target cell type must be such that cells can be 
regenerated into whole plants. 
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Plants transformed with the DNA segment containing the 
sequence may be produced by standard techniques which are 
already known for the genetic manipulation of plants. DNA can 
be transformed into plant cells using any suitable technology, 
such as a disarmed Ti-plasmid vector carried by Agrobacterium 
exploiting its natural gene transfer ability (EP-A-2703 55 , EP- 
A-0116718, NAR 12(22) 8711 - 87215 1984), particle or 
microprojectile bombardment {US 5100792, EP-A-444882, EP-A- 
434616) microinjection (WO 92/09696, WO 94/00583, EP 331083, 
EP 175966, Green et al . (1987) Plant Tissue and Cell Culture, 

Academic Press), electroporation (EP 290395, WO 8706614) other 
forms of direct DNA uptake (DE 4005152, WO 9012096, US 
4684611), liposome mediated DNA uptake (e.g. Freeman et al. 

Plant Cell Physiol. 29: 1353 (1984)), or the vortexing method 

{e.g. Kindle, PNAS U.S.A. 87: 1228 (1990d) Physical methods 

for the transformation of plant cells are reviewed in Oard, 
1991, Biotech. Adv. 9: 1-11. 

Agrobacterium transformation is widely used by those skilled 
in the art to transform dicotyledonous species . There are 
various approaches used for the routine production of stable, 
fertile transgenic plants in almost all economically relevant 
monocot plants (Toriyama, et al . (1988) Bio/Technology 6, 

1072-1074; Zhang, et al. (1988) Plant Cell Rep. 7 , 379-384; 

Zhang, et al. (1988) Theor Appl Genet 76, 835-840; Shimamoto, 

et al. (1989) Nature 338, 274-276; Datta, et al. (1990) 

Bio/Technology 8, 736-740; Christou, et al . (1991) 

Bio/Technology 9, 957-962; Peng, et al . (1991) International 

Rice Research Institute, Manila, Philippines 563-574; Cao, et 

al. (1992) Plant Cell Rep. 11, 585-591; Li, et al . (1993) 

Plant Cell Rep. 12, 250-255; Rathore, et al. (1993) Plant 

Molecular Biology 21, 871-884; Fromm, et al . (1990) 

Bio/Technology 8, 833-839; Gordon-Kamm, et al . (1990) Plant 

Cell 2, 603-618; D'Halluin, et al . (1992) Plant Cell 4, 1495- 

1505; Walters, et al . (1992) Plant Molecular Biology 18, 189- 
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200; Koziel, et al. {1993} Biotechnology 11, 194-200; Vasil, 
I. K. (1994) Plant Molecular Biology 25, 925-937; Weeks, et 
al. (1993) Plant Physiology 102, 1077-1084; Somers, et al . 
(1992) Eio/Techno2ogy 10 , 1589-1594; W092/14828). In 
5 particular, Agrobacterium mediated transformation is now 
emerging also as an highly efficient alternative 
transformation method in monocots (Hiei et al . (1994) The 
Plant Journal 6, 271-282) . 



10 The generation of fertile transgenic plants has been achieved 
in the cereals rice, maize, wheat, oat, and barley (reviewed 
in Shimamoto, K. (1994) Current Opinion in Biotechnology 5, 
158-162.; Vasil, et al. (1992) Bio/Technology 10, 667-674; 
Vain et al., 1995, Biotechnology Advances 13 (4): 653-671; 

15 Vasil, 1996, Nature Biotechnology 14 page 702). 

Microprojectile bombardment, electroporation and direct DNA 
uptake are preferred where Agrobacterium is inefficient or 
ineffective. Alternatively, a combination of different 
20 techniques may be employed to enhance the efficiency of the 
transformation process, eg bombardment with Agrobacterium 
coated microparticles (EP-A-486234) or microprojectile 
bombardment to induce wounding followed by co- cultivation with 
Agrobacterium (EP-A-486233) . 

25 

Following transformation, a plant may be regenerated, e.g. 
from single cells, callus tissue or leaf discs, as is standard 
in the art. Almost any plant can be entirely regenerated from 
cells, tissues and organs of the plant. Available techniques 
30 are reviewed in Vasil et al . , Cell Culture and Somatic Cell 
Genetics of Plants, Vol I, II and III, Laboratory Procedures 
and Their Application?, Academic Press, 1984, and Weissbach 
and Weissbach, Methods for Plant Molecular Biology, Academic 
Press, 1989. 
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The particular choice of a transformation technology will be 
determined by its efficiency to transform certain plant 
species as well as the experience and preference of the person 
practising the invention with a particular methodology of 
choice. It will be apparent to the skilled person that the 
particular choice of a transformation system to introduce 
nucleic acid into plant cells is not essential to or a 
limitation of the invention, nor is the choice of technique 
for plant regeneration. 

A VRN2 gene and modified versions thereof (alleles, mutants, 
variants and derivatives thereof) , and other nucleic acid 
provided herein, including species homologues, may be used to 
modify vernalization response, flowering time, leaf shape 
and/or shade avoidance response in a transgenic plant. 
Nucleic acid such as a vector as described herein may be used 
for the production of a transgenic plant. Such a plant may 
possess an altered phenotype, particular in terms of 
vernalization response, flowering time, leaf shape and/or 
shade avoidance response compared with wild-type (that is to 
say a plant that is wild-type for VRN2 or the relevant 
homologue thereof) . 

The invention further encompasses a host cell transformed with 
nucleic acid or a vector according to the present invention, 
especially a plant or a microbial cell. Thus, a host cell, 
such as a plant cell, including heterologous nucleic acid 
according to the present invention is provided. Within the 
cell, the nucleic acid may be incorporated within the 
chromosome. There may be more than one heterologous 
nucleotide sequence per haploid genome. 

Also according to the invention there is provided a plant cell 
having incorporated into its genome nucleic acid, particularly 
heterologous nucleic acid, as provided by the present 
invention, under operative control of a regulatory sequence 
for control of expression. The coding sequence may be 
operably linked to one or more regulatory sequences which may 
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be heterologous or foreign to the gene, such as not naturally 
associated with the gene for its expression. The nucleic acid 
according to the invention may be placed under the control of 
an externally inducible gene promoter to place expression 
5 under the control of the user. 

A suitable inducible promoter is the GST- II -27 gene promoter 
which has been shown to be induced by certain chemical 
compounds which can be applied to growing plants. The 

10 promoter is functional in both monocotyledons and 

dicotyledons. It can therefore be used to control gene 
expression in a variety of genetically modified plants, 
including field crops such as canola, sunflower, tobacco, 
sugarbeet, cotton; cereals such as wheat, barley, rice, maize, 

15 sorghum; fruit such as tomatoes, mangoes, peaches, apples, 

pears, strawberries, bananas, and melons; and vegetables such 
as carrot, lettuce, cabbage and onion. The GST- II -27 promoter 
is also suitable for use in a variety of tissues, including 
roots, leaves, stems and reproductive tissues. 

20 

A further aspect of the present invention provides a method of 
making such a plant cell involving introduction of nucleic 
acid or a suitable vector including the sequence of 
nucleotides into a plant cell and causing or allowing 
25 recombination between the vector and the plant cell genome to 
introduce the sequence of nucleotides into the genome. The 
invention extends to plant cells containing nucleic acid 
according to the invention as a result of introduction of the 
nucleic acid into an ancestor cell. 

30 

The term "heterologous" - may be used to indicate that the 
gene/sequence of nucleotides in question have been introduced 
into said cells of the plant or an ancestor thereof, using 
genetic engineering, ie by human intervention. A transgenic 
35 plant cell, i.e. transgenic for the nucleic acid in question, 
may be provided. The transgene may be on an extra-genomic 
vector or incorporated, preferably stably, into the genome. A 
heterologous gene may replace an endogenous equivalent gene, 
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ie one which normally performs the same or a similar function, 
or the inserted sequence may be additional to the endogenous 
gene or other sequence. An advantage of introduction of a 
heterologous gene is the ability to place expression of a 
5 sequence under the control of a promoter of choice, in order 
to be able to influence expression according to preference. 
Furthermore, mutants, variants and derivatives of the wild- 
type gene, e.g. with higher or lower activity than wild- type, 
may be used in place of the endogenous gene. Nucleic acid 

10 heterologous, or exogenous or foreign, to a plant cell may be 
non-naturally occuring in cells of that type, variety or 
species. Thus, nucleic acid may include a coding sequence of 
or derived from a particular type of plant cell or species or 
variety of plant, placed within the context of a plant cell of 

15 a different type or species or variety of plant. A further 

possibility is for a nucleic acid sequence to be placed within 
a cell in which it or a homologue is found naturally, but 
wherein the nucleic acid sequence is linked and/or adjacent to 
nucleic acid which does not occur naturally within the cell, 

20 or cells of that type or species or variety of plant, such as 
operably linked to one or more regulatory sequences, such as a 
promoter sequence, for control of expression. A sequence 
within a plant or other host cell may be identif iably 
heterologous, exogenous or foreign. 

25 

Plants which include a plant cell according to the invention 
are also provided, along with any part or propagule thereof, 
seed, selfed or hybrid progeny and descendants. A plant 
according to the present invention may be one which does not 

30 breed true in one or more properties. Plant varieties may be 
excluded, particularly registrable plant varieties according 
to Plant Breeders' Rights. It is noted that a plant need not 
be considered a "plant variety" simply because it contains 
stably within its genome a transgene, introduced into a cell 

3 5 of the plant or an ancestor thereof. 

In addition to a plant, the present invention provides any 
clone of such a plant, seed, selfed or hybrid progeny and 
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descendants, and any part of any of these, such as cuttings, 
seed. The invention provides any plant propagule, that is any 
part which may be used in reproduction or propagation, sexual 
or asexual, including cuttings, seed and so on. Also 
5 encompassed by the invention is a plant which is a sexually or 
asexually propagated off -spring, clone or descendant of such a 
plant, or any part or propagule of said plant, off -spring, 
clone or descendant. 



10 The invention further provides a method of influencing or 

affecting a physical characteristic of a plant, particularly 
vernalization response, flowering time, leaf shape and/or 
shade avoidance response, including causing or allowing 
expression of a heterologous nucleic acid sequence as 

15 discussed within cells of the plant. 

The invention further provides a method of inducing expression 
from nucleic acid encoding a VRN2 polypeptide, or a mutant, 
variant, allele or derivative of the sequence, or a homologue, 

20 according to the disclosure herein, within cells of a plant 
(thereby producing the encoded polypeptide) , following an 
earlier step of introduction of the nucleic acid into a cell 
of the plant or an ancestor thereof. Such a method may 
influence or affect a characteristic of the plant, such as 

25 vernalization response, flowering time, leaf shape and/or 
shade avoidance response. This may be used in combination 
with any other gene, such as transgenes involved in flowering 
(e,g. FCA) or other phenotypic trait or desirable property. 



30 The present invention also encompasses the expression product 
of any of the nucleic acid sequences disclosed and methods of 
making the expression product by expression from encoding 
nucleic acid therefore under suitable conditions, which may be 
in suitable host cells. Following expression, the product may 

35 be isolated from the expression system and may be used as 
desired, for instance in formulation of a composition 
including at least one additional component. 
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The present invention also provides for the production and use 
of fragments of the full-length polypeptides disclosed herein, 
especially active portions thereof. An "active portion" of a 
polypeptide means a peptide which is less than said full 
5 length polypeptide, but which retains an essential biological 
activity. In particular, the active portion retains the 
ability to alter vernalization response, flowering time, leaf 
shape and/or shade avoidance response in a plant, such as 
Arabidopsis tha.li ana . 

10 

A "fragment" of a polypeptide means a stretch of amino acid 
residues of at least about five to seven contiguous amino 
acids, often. at least about seven to nine contiguous amino 
acids, typically at least about nine to 13 contiguous amino 

■15 acids and, most preferably, at least about 2 0 to 3 0 or more 
contiguous amino acids. Fragments of the polypeptides may 
include one or more epitopes useful for raising antibodies to 
a portion of any of the amino acid sequences disclosed herein. 
Preferred epitopes are those to which antibodies are able to 

;20 bind specifically, which may be taken to be binding a 

polypeptide or fragment thereof of the invention with an 
affinity which is at least about lOOOx that of other 
polypeptides . 

25 Among preferred VRN2 fragments according to the present 

invention are the zinc finger domain, DNA binding domain and 
other domains disclosed herein. 

Purified protein according to the present invention, or a 
30 fragment, mutant, derivative or variant thereof, e.g. produced 
recombinantly by expression from encoding nucleic acid 
therefor, may be used to raise antibodies employing techniques 
which are standard in the art. Antibodies and polypeptides 
comprising antigen- binding fragments of antibodies may be used 
35 in identifying homologues from other species as discussed 

further below, also in identifying complexes containing VRN2 
protein. 
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Methods of producing antibodies include immunising a mammal 
(e.g. human, mouse, rat, rabbit, horse, goat, sheep or monkey) 
with the protein or a fragment thereof. Antibodies may be 
obtained from immunised animals using any of a variety of 
5 techniques known in the art, and might be screened, preferably 
using binding of antibody to antigen of interest. For 
instance, Western blotting techniques or immunoprecipitation 
may be used (Armitage et al, 1992, Nature 357: 80-82). 
Antibodies may be polyclonal or monoclonal. 

10 

As an alternative or supplement to immunising a mammal, 
antibodies with appropriate binding specificity may be 
obtained from a recombinantly produced library of expressed 
immunoglobulin variable domains, e.g. using lambda 
15 bacteriophage or filamentous bacteriophage which display 

functional immunoglobulin binding domains on their surfaces; 
for instance see WO92/01047. 

Antibodies raised to a polypeptide or peptide can be used in 

20 the identification and/or isolation of homologous 

polypeptides, and then the encoding genes. Thus, the present 
invention provides a % method of identifying or isolating a 
polypeptide with the desired function (in accordance with 
embodiments disclosed herein) , comprising screening candidate 

25 polypeptides with a polypeptide comprising the antigen-binding 
domain of an antibody (for example whole antibody or a 
suitable fragment thereof, e.g. scFv, Fab) which is able to 
bind a polypeptide or fragment, variant or derivative thereof 
according to the present invention or preferably has binding 

30 specificity for such a polypeptide. Specific binding members 
such as antibodies and polypeptides comprising antigen binding 
domains of antibodies that bind and are preferably specific 
for a polypeptide or mutant, variant or derivative thereof 
according to the invention represent further aspects of the 

35 present invention, particularly in isolated and/or purified 
form, as do their use and methods which employ them. 



SUBSTITUTE SHEET (RULE 26) 



WO 00/44918 PCT/GB00/00248 

23 

Candidate polypeptides for screening may for instance be the 
products of an expression library created using nucleic acid 
derived from an plant of interest, or may be the product of a 
purification process from a natural source, A polypeptide 
found to bind the antibody may be isolated and then may be 
subject to amino acid sequencing. Any suitable technique may 
be used to sequence the polypeptide either wholly or partially 
(for instance a fragment of the polypeptide may be sequenced) . 
Amino acid sequence information may be used in obtaining 
nucleic acid encoding the polypeptide, for instance by 
designing one or more oligonucleotides (e,g, a degenerate pool 
of oligonucleotides) for use as probes or primers in 
hybridization to candidate nucleic acid, or by searching 
computer sequence databases, as discussed further below, 

A further aspect of the present invention provides a method of 
identifying and cloning VRN2 homologues from plant species 

other than Arabidopsis tha.lia.na which method employs a 

nucleotide sequence derived from that shown herein. As 
discussed above, sequences derived from these may themselves 
be used in identifying and in cloning other sequences. The 
nucleotide sequence information provided herein, or any part 
thereof, may be used in a data-base search to find homologous 
sequences, expression products of which can be tested for 
ability to influence a plant characteristic. These may have 
ability to alter vernalization response, flowering time, leaf 
shape and/or shade avoidance response in a plant. 
Alternatively, nucleic acid libraries may be screened using 
techniques well known to those skilled in the art and 
homologous sequences thereby identified then tested. 

The present invention also extends to nucleic acid encoding a 
VRN2 homologue obtained using a nucleotide sequence derived 
from any of those shown herein. 

In certain embodiments, nucleic acid according to the present 
invention encodes a polypeptide which has homology with all or 
part of VRN2 amino acid sequence shown herein, in the terms 
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discussed already above (e.g. for length) , which homology is 
greater over the length of the relevant part (i.e. fragment) 
than the homology shared between a respective part of the VRN2 
amino acid sequence of Arabidopsis tha.lia.na, and the other 

5 sequences shown in Figure 8a or Figure 8b, and may be greater 
than about 5% greater, more preferably greater than about 10% 
greater, more preferably greater than about 20% greater, and 
more preferably greater than about 30% greater. Thus, to 
exemplify with reference to one embodiment , nucleic acid 

10 encoding an amino acid mutant, variant or derivative of the 
amino acid sequence shown in SEQ ID NO: 2 may be provided 
wherein the encoded amino acid sequence includes a contiguous 
sequence of about 100 amino acids which has greater homology 
with a contiguous sequence of 100 amino acids within the amino 

15 acid sequence of SEQ ID NO: 2 than any contiguous sequence of 
100 amino acids within another sequence shown in Figure 8a or 
8b , preferably greater than about 5% greater homology, and so 
on. 

20 Similarly, nucleic acid according to certain embodiments of 
the present invention may have homology with all or part of a 
nucleotide sequence shown herein, in the terms discussed 
already above (e.g. for length) , which homology is greater 
over the length of the relevant part (i.e. fragment) than the 

25 homology shared with a respective part of the natural coding 
nucleotide sequence for the other amino acid sequences shown 
in Figure 8a or 8b and referenced herein, and may be greater 
than about 5% greater, more preferably greater than about 10% 
greater, more preferably greater than about 20% greater, and 

30 more preferably greater than about 30% greater. 

The provision of sequence information for the VRN2 gene of 
Arabidopsis thaliana enables the obtention of homologous 
sequences from other plant species. In particular, 
35 homologues may be easily isolated from related, commercially 
important species that have a vernalization requirement, or 
show some response to vernalization. These would include all 
members of the Brassicaceae , and other dicots including 
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tobacco, sugarbeet, peas and celery. Monocots included in 
this category are the cereals rice, wheat and barley. 

Thus, included within the scope of the present invention are 
5 nucleic acid molecules which encode amino acid sequences which 
are homologues of VRN2 of Arabidopsis thalia.ua. Homology may 

be at the nucleotide sequence and/or amino acid sequence 
level, as has already been discussed above. A homologue from 
a species other than Arabidopsis thaliana encodes a product 

10 which causes a phenotype similar to that caused by the VRW2 

gene, generally including ability to alter vernalization 
response, flowering time, leaf shape and/or shade avoidance 
response in a plant, such as in Arabidopsis thaliana. In 

addition, mutants, derivatives or alleles of these genes may 
;15 have altered, e.g. increased or decreased, activity or ability 
compared with wild- type. 

VRN2 gene homologues may also be identified from economically 
important monocotyledonous crop plants including the cereals 
[20 rice, wheat and barley. Although genes encoding the same 
protein in monocotyledonous and dicotyledonous plants show 
relatively little homology at the nucleotide level, amino acid 
sequences are conserved. Therefore it is possible to use 
public sequence databases to identify Arabidopsis, rice or 

25 maize cDNA clone sequences that were obtained in random 
sequencing programmes and share homology to the gene of 
interest, as has been done for other genes isolated from 
Arabidopsis (e.g CO; WO 96/14414) . Of course, mutants, 
derivatives and alleles of these sequences are included within 

3 0 the scope of the present invention in the same terms as 
discussed above for the Arabidopsis thaliana VRN2 gene. 

According to a further aspect, the present invention provides 
a method of identifying or a method of cloning a VRN2 
35 homologue, e.g. from a species other than Arabidopsis thaliana 
the method employing a nucleotide sequence derived from any of 
those shown herein. For instance, such a method may employ an 
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oligonucleotide or oligonucleotides which comprises or 
consists of a sequence or sequences conserved between or 
encoding a sequence or sequences conserved between the 
sequences shown in Figure 8a or 8b, or a sequence or sequences 
conserved between the sequences of SEQ ID NO: 2 and SEQ ID NO: 
5, or encoding sequences SEQ ID NO: 1 and SEQ ID NO: 4, to 
search for homologues . Thus, a method of obtaining nucleic 
acid is provided, comprising hybridisation of an 
oligonucleotide or a nucleic acid molecule comprising such an 
oligonucleotide to target/candidate nucleic acid. Target or 
candidate nucleic acid may, for example, comprise a genomic or 
cDNA library obtainable from an organism known to contain or 
suspected of containing such nucleic acid, either 
monocotyledonous or dicotyledonous- Successful hybridisation 
may be identified and target /candidate nucleic acid isolated 
for further investigation and/or use. 

Hybridisation may involve probing nucleic acid and identifying 
positive hybridisation under suitably stringent conditions (in 
accordance with known techniques) and/or use of 
oligonucleotides as primers in a method of nucleic acid 
amplification, such as PCR. For probing, preferred conditions 
are those which are stringent enough for there to be a simple 
pattern with a small number of hybridisations identified as 
positive which can be investigated further. It is well known 
in the art to increase stringency of hybridisation gradually 
until only a few positive clones remain* 

For instance, screening may initially be carried out under 
conditions, which comprise a temperature of about 37°C or 
more, a formamide concentration of less than about 50%, and a 
moderate to low salt (e.g. Standard Saline Citrate ("SSC 1 ) = 
0.15 M sodium chloride; 0.15 M sodium citrate; pH 7) 
concent rat ion . 

Alternatively, a temperature of about 50°C or more and a high 
salt (e.g. "SSPE'= 0.180 M sodium chloride; 9 mM disodium 
hydrogen phosphate; 9 mM sodium dihydrogen phosphate; 1 mM 
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sodium EDTA; pH 7.4} . Preferably the screening is carried out 
at about 37°C / a formamide concentration of about 20%, and a 
salt concentration of about 5 X SSC, or a temperature of about 
50°C and a salt concentration of about 2 X SSPE. These 
5 conditions will allow the identification of sequences which 
have a substantial degree of homology (similarity, identity) 
with the probe sequence, without requiring the perfect 
homology for the identification of a stable hybrid. 

10 Suitable conditions include, e.g. for detection of sequences 
that are about 80-90% identical, hybridization overnight at 
42°C in 0.25M Na 2 HP0 4/ pH 7.2, 6.5% SDS, 10% dextran sulfate 
and a final wash at 55°C in 0.1X SSC, 0.1% SDS. For detection 
of sequences that are greater than about 9 0% identical, 

15 suitable conditions include hybridization overnight at 65 °C in 
0.25M Na 2 HP0 4 , pH 7.2, 6.5% SDS, 10% dextran sulfate and a 
final wash at 60°C in 0.1X SSC, 0.1% SDS. 

An alternative is a solution of 5x SSPE {final 0.9 M NaCl, 
20 0.05M sodium phosphate, 0.005M ethylenediamnetetraacetic acid 
(EDTA) pH 7.7), 5X Denhardt 1 s solution, 0.5% SDS {sodium 
dodecyl sulphate), at 65°C overnight, (for high stringency, 
highly similar sequences) or 50 °C {for low stringency, less 
similar sequences). Washes in 0 . 2x SSC/0.1% SDS at 65°C for 
25 high stringency, alternatively at 50-60°C in lx SSC/0.1%SDS 
for low stringency. 

The present invention extends to nucleic acid selectively 
hybridisable under high stringency with nucleic acid 
30 identified herein. 

As an alternative to probing, though still employing nucleic 
acid hybridisation, oligonucleotides designed to amplify DNA 
sequences may be used in PCR reactions or other methods 
35 involving amplification of nucleic acid, using routine 

procedures. See for instance "PCR protocols; A Guide to 
Methods and Applications", Eds. Innis et al, 1990, Academic 
Press, New York. 
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Preferred amino acid sequences suitable for use in the design 
of probes or PCR primers for some purposes are sequences 
conserved (completely, substantially or partly) between the 
VRN2 sequence and at least one other of the sequences shown in 

5 Figure 8a or 8b. 

Preferred primers for amplification of conserved regions of 
VRN2 for use as probes to obtain genomic or cDNA clones may 

include the following: 

10 

Primers VRN2-AI and VRN2-AJ which, in RT-PCR, amplify a 1583 
bp fragment that contains the complete VRN2 open reading 
frame, and portions of both the 5* and 3* untranslated 
sequences ; 

15 

Primers VRN2-AP and VRN2-AJ which, in RT-PCR, amplify a 781 bp 
fragment that includes the conserved acidic region; 

Primers VRN2-AO and VRN2-AS which, in RT-PCR, amplify a 493 bp 
20 fragment that includes the zinc-finger motif, and the second 
nuclear localization signal (NLS) ; and 

Primers VRN2-AI and VRN2-AJ from genomic DNA, which amplify a 
3 605 bp product that includes most of the VRN2 gene, except 
25 the promoter and 3 1 regions (i.e. encompasses the same regions 
as the VRN2-AI/AJ pair above, but with the introns, useful for 
hybridisation to genomic DNA, less so for cDNA) . 

On the basis of amino acid sequence information 
30 oligonucleotide probes or primers may be designed, taking into 
account the degeneracy of the genetic code, and, where 
appropriate, codon usage of the organism from which the 
candidate nucleic acid is derived, 

3 5 Preferably an oligonucleotide in accordance with certain 
embodiments of the invention, e.g. for use in nucleic acid 
amplification, is up to about 50 nucleotides, or about 4 0 



SUBSTITUTE SHEET (RULE 26) 



WO 00/44918 PCT/GBOO/00248 

29 

nucleotides or about 30 or fewer nucleotides in length (e.g. 
18, 21 or 24) . 

Assessment of whether or not such a PCR product corresponds to 
a homologue gene may be conducted in various ways* A PCR band 
from such a reaction might contain a complex mix of products. 
Individual products may be cloned and each one individually 
screened. It may be analysed by transformation to assess 
function on introduction into a plant of interest. 

As noted, nucleic acid according to the present invention is 
obtainable using oligonucleotides, designed on the basis of 
sequence information provided herein, as probes or primers. 
Nucleic acid isolated and/or purified from one or more cells 
of a plant (see above) , or a nucleic acid library derived from 
nucleic acid isolated and/or purified from the plant (e.g. a 
cDNA library derived from mRNA isolated from the plant) , may 
be probed under conditions for selective hybridisation and/or 
subjected to a specific nucleic acid amplification reaction 
such as the polymerase chain reaction (PCR) . The nucleic acid 
probed or used as template in the amplification reaction may 
be genomic DNA, cDNA or RNA. If necessary, one or more gene 
fragments may be ligated to generate a full-length coding 
sequence . 

PCR primers derived from the VRN2 sequences disclosed herein 
may readily be tested for their specificity for amplifying 
nucleic acid according to the present invention, using both 
genomic DNA and RT-PCR templates. Cloning and subsequent 
sequencing of PCR products may be used to indicate 
amplification of the expected derived gene fragment. Full 
length cDNA clones can be obtained as described by 5« and 3 r 
RACE technology if RT-PCR products are used as templates. 

Various aspects of the present invention include the 
obtainable nucleic acid, methods of screening material, e.g. 
cell lysate, nucleic acid preparations, for the presence of 
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nucleic acid of interest, methods of obtaining the nucleic 
acid, and suitable primers and primer combinations. 

The sequence information provided herein also allows the 
design of diagnostic tests for determination of the presence 
of a specific gene or allele thereof in any given plant, 
cultivar, variety, population, landrace, part of a family or 
other selection in a breeding programme or other such 
genotype. A diagnostic test may be based on determination of 
the presence or absence of a particular allele by means of 
nucleic acid or polypeptide determination. 

At the nucleic acid level, this may involve hybridisation of a 
suitable oligo- or poly-nucleotide, such as a fragment of the 
gene or a homologue thereof, including any homologue disclosed 
herein, or any particular allele, such as an allele which 
gives a desirable phenotype, such as any such allele disclosed 
herein. The hybridisation may involve PCR designed to amplify 
a product from a given allelic version of the gene, with 
subsequent detection of an amplified product by any of a 
number of possible methods including but not limited to gel 
electrophoresis, capillary electrophoresis, direct 
hybridisation of nucleotide sequence probes and so on. A 
diagnostic test may be based on PCR designed to amplify 
various alleles or any allele from the relevant locus, with a 
test to distinguish the different possible alleles by any of a 
number of possible methods, including DNA fragment size, 
restriction site variation (e.g. CAPS - cleaved amplified 
polymorphic sites) and so on. A diagnostic test may also be 
based on a great number of possible variants of nucleic acid 
analysis that will be apparent to those skilled in the art, 
such as use of a synthetic sequence as a hybridisation probe. 

Broadly, the methods divide into those screening for the 
presence of nucleic acid sequences and those that rely on 
detecting the presence or absence of a polypeptide. The 
methods may make use of biological samples from one or more 
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plants or cells that are suspected to contain the nucleic acid 
sequences or polypeptide. 

Exemplary approaches for detecting nucleic acid or 
5 polypeptides include analysing a sample from the plant or 
plant cell by: 

(a) comparing the sequence of nucleic acid in the sample with 
all or part of a nucleotide sequence shown herein, to 

10 determine whether the sample contains a mutation; 

(b) determining the presence in the sample of a polypeptide 
including a VRN2 amino acid sequence shown herein, or a 
fragment thereof and, if present, determining whether the 

15 polypeptide is full length, and/or is mutated, and/or is 
expressed at the normal level; 

(c) performing DMA fingerprinting to compare the restriction 
pattern produced when a restriction enzyme cuts nucleic acid 

20 in the sample with the restriction pattern obtained from a 
nucleotide sequence shown herein, or from a known mutant, 
allele or variant thereof; 

(d) contacting the sample with a specific binding member 

25 capable of binding to nucleic acid including the nucleotide 
sequence as set out herein, or a fragment thereof, or a 
mutant, allele or variant thereof, the specific binding member 
including nucleic acid hybridisable with a VRN2 sequence 
herein, or a polypeptide including a binding domain with 
specificity for nucleic acid including a VRN2 sequence or 
polypeptide encoded by it, or a mutated form thereof, and 
determining binding of the specific binding member ; 
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(e) performing PCR involving one or more primers based on a 
nucleotide sequence shown herein to screen the sample for 
nucleic acid including the nucleotide sequence of SEQ ID NO: 1 
or SEQ ID NO: 4 or a mutant, allele or variant thereof. 
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When screening for a VRN2 allele nucleic acid, the nucleic 

acid in the sample will initially be amplified, e.g. using 
PCR, to increase the amount of the analyte as compared to 
other sequences present in the sample. This allows the target 
sequences to be detected with a high degree of sensitivity if 
they are present in the sample. This initial step may be 
avoided by using highly sensitive array techniques that are 
becoming increasingly important in the art. 

A variant form of the gene may contain one or more insertions, 
deletions, substitutions and/or additions of one or more 
nucleotides compared with the wild-type sequence which may or 
may not disrupt or alter the gene function. Differences at 
the nucleic acid level are not necessarily reflected by a 
difference in the amino acid sequence of the encoded 
polypeptide. However, a mutation or other difference in a 
gene may result in a frame- shift or stop codon, which could 
seriously affect the nature of the polypeptide produced (if 
any) , or a point mutation or gross mutational change to the 
encoded polypeptide, including insertion, deletion, 
substitution and/or addition of one or more amino acids or 
regions in the polypeptide. A mutation in a promoter sequence 
or other regulatory region may prevent or reduce expression 
from the gene or affect the processing or stability of the 
mRNA transcript . 

Tests may be carried out on preparations containing genomic 
DNA, cDNA and/or mRNA. Testing cDNA or mRNA has the advantage 
of the complexity of the nucleic acid being reduced by the 
absence of intron sequences, but the possible disadvantage of 
extra time and effort being required in making the 
preparations. RNA is more difficult to manipulate than DNA 
because of the wide -spread occurrence of RN'ases. 

Nucleic acid in a test sample may be sequenced and the 
sequence compared with a sequence shown herein to determine 
whether or not a difference is present. If so, the difference 
can be compared with known alleles to determine whether the 
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test nucleic acid contains one or more of the variations 
indicated, or the difference can be investigated for 
association with a desired phenotype. 

5 The amplified nucleic acid may then be sequenced as above, 
and/or tested in any other way to determine the presence or 
absence of a particular feature. Nucleic acid for testing may 
be prepared from nucleic acid removed from cells or in a 
library using a variety of other techniques such as 
10 restriction enzyme digest and -electrophoresis . 

Nucleic acid may be screened using a variant- or allele- 
m specific probe. Such a probe corresponds in sequence to a 
5r region of the gene, or its complement, containing a sequence 
j?;15 alteration known to be associated with alteration of ability 
ji| to affect vernalization response, flowering time, leaf shape 
J~ and/or shade avoidance response. Under suitably stringent 
H- conditions, specific hybridisation of such a probe to test 
jT nucleic acid is indicative of the presence of the sequence 
i|20 alteration in the test nucleic acid. For efficient screening 
:J purposes, more than one probe may be used on the same test 

sample* 

Allele- or variant -specific oligonucleotides may similarly be 
25 used in PCR to specifically amplify particular sequences if 
present in a test sample. Assessment of whether a PCR band 
contains a gene variant may be carried out in a number of ways 
familiar to those skilled in the art. The PCR product may for 
instance be treated in a way that enables one to display the 
3 0 mutation or polymorphism on a denaturing polyacrylamide DNA 
sequencing gel, with specific bands that are linked to the 
gene variants being selected. 

An alternative or supplement to looking for the presence of 
35 variant sequences in a test sample is to look for the presence 
of the normal sequence, e.g. using a suitably specific 
oligonucleotide probe or primer. 
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Approaches which rely on hybridisation between a probe and 
test nucleic acid and subsequent detection of a mismatch may 
be employed. Under appropriate conditions (temperature, pH 
etc.), an oligonucleotide probe will hybridise with a sequence 
5 which is not entirely complementary. The degree of base- 
pairing between the two molecules will be sufficient for them 
to anneal despite a mis-match. Various approaches are well 
known in the art for detecting the presence of a mis -match 
between two annealing nucleic acid molecules. 

10 

For instance, RN'ase A cleaves at the site of a mis -match. 
Cleavage can be detected by electrophoresing test nucleic acid 
to which the relevant probe or probe has annealed and looking 
for smaller molecules (i.e. molecules with higher 
15 electrophoretic mobility) than the full length probe/test 

hybrid. Other approaches rely on the use of enzymes such as 
resolvases or endonucleases . 

Thus, an oligonucleotide probe that has the sequence of a 
20 region of the normal gene (either sense or anti-sense strand) 
in which mutations associated with particular phenotypes are 
known to occur may be* annealed to test nucleic acid and the 
presence or absence of a mis -match determined. Detection of 
the presence of a mis -match may indicate the presence in the 
25 test nucleic acid of a mutation. On the other hand, an 

oligonucleotide probe that has the sequence of a region of the 
gene including a mutation may be annealed to test nucleic acid 
and the presence or absence of a mis -match determined. The 
presence of a mis-match may indicate that the nucleic acid in 
3 0 the test sample has the normal sequence, or a different mutant 
or allele sequence. In either case, a battery of probes to 
different regions of the gene may be employed. 

The presence of differences in sequence of nucleic acid 
3 5 molecules may be detected by means of restriction enzyme 

digestion, such as in a method of DNA fingerprinting where the 
restriction pattern produced when one or more restriction 
enzymes are used to cut a sample of nucleic acid is compared 
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with the pattern obtained when a sample containing the normal 
gene or a variant or allele is digested with the same enzyme 
or enzymes. 

5 The presence of absence of a lesion in a promoter or other 
regulatory sequence may also be assessed by determining the 
level of mRNA production by transcription or the level of 
polypeptide production by translation from the mRNA. 

10 Nucleic acid isolated and/or purified from one or more cells 
of a plant or a nucleic acid library derived from nucleic acid 
isolated and/or purified from cells (e.g. a cDNA library 
derived from mRNA isolated from the cells) , may be probed 
under conditions for selective hybridisation and/or subjected 

15 to a specific nucleic acid amplification reaction such as the 
polymerase chain reaction (PCR) . 

A method may include hybridisation of one or more (e.g. two) 
probes or primers to target nucleic acid. Where the nucleic 

20 acid is double- stranded DMA, hybridisation will generally be 
preceded by denaturation to produce single-stranded DNA. The 
hybridisation may be as part of a PCR procedure, or as part of 
a probing procedure not involving PCR. An example procedure 
would be a combination of PCR and low stringency 

25 hybridisation. A screening procedure, chosen from the many 
available to those skilled in the art, is used to identify 
successful hybridisation events and isolate hybridised nucleic 
acid. 



30 



Binding of a probe to target nucleic acid (e.g. DNA) may be 
measured using any of a variety of techniques at the disposal 
of those skilled in the art. For instance, probes may be 
radioactively, f luorescently or enzymatically labelled. Other 
methods not employing labelling of probe include examination 
35 of restriction fragment length polymorphisms, amplification 

using PCR, RNAase cleavage and allele specific oligonucleotide 
probing. 
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Probing may employ the standard Southern blotting technique. 
For instance DNA may be extracted from cells and digested with 
different restriction enzymes. Restriction fragments may then 
be separated by electrophoresis on an agarose gel, before 
5 denaturation and transfer to a nitrocellulose filter. 

Labelled probe may be hybridised to the DNA fragments on the 
filter and binding determined. DNA for probing may be 
prepared from RNA preparations from cells. 

10 Preliminary experiments may be performed by hybridising under 
low stringency conditions various probes to Southern blots of 
«*, DNA digested with restriction enzymes. Suitable conditions 

'hex 

fii would be achieved when a large number of hybridising fragments 

were obtained while the background hybridisation was low. 
ri5 Using these conditions nucleic acid libraries, e.g. cDNA 
libraries representative of expressed sequences, may be 
searched. 

As noted, those skilled in the art are well able to employ 
suitable conditions of the desired stringency for selective 
hybridisation, taking into account factors such as 
oligonucleotide length and base composition, temperature and 
so on. 

25 In some preferred embodiments of diagnostic assays according 
to the present invention, oligonucleotides according to the 
present invention that are fragments of any of the sequences 
shown herein, or any allele associated with a desired 
phenotype are at least about 10 nucleotides in length, more 
3 0 preferably at least about 15 nucleotides in length, more 
preferably at least about 20 nucleotides in length, more 
preferably about 3 0 nucleotides in length. Such fragments 
themselves individually represent aspects of the present 
invention. Fragments and other oligonucleotides may be used 
3 5 as primers or probes as discussed but may also be generated 
(e.g. by PGR) in methods concerned with determining the 
presence in a test sample of a sequence indicative of a 
desired phenotype. 
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There are various methods for determining the presence or 
absence in a test sample of a particular polypeptide, such as 
a polypeptide including the amino acid sequence shown in SEQ 
ID NO: 2 or SEQ ID NO: 5, or an amino acid sequence mutant, 
variant or allele thereof. 

A sample may be tested for the presence of a binding partner 
for a specific binding member such as an antibody (or mixture 
of antibodies) , specific for one or more particular variants 
of a polypeptide shown herein. 

In such cases, the sample may be tested by being contacted 
with a specific binding member such as an antibody under 
appropriate conditions for specific binding, before binding is 
determined, for instance using a reporter system as discussed. 
Where a panel of antibodies is used, different reporting 
labels may be employed for each antibody so that binding of 
each can be determined. 

A specific binding member such as an antibody may be used to 
isolate and/or purify its binding partner polypeptide from a 
test sample, to allow for sequence and/or biochemical analysis 
of the polypeptide to determine whether it has the sequence 
and/or properties of the wild-type polypeptide or a particular 
mutant, variant or allele thereof. Amino acid sequence is 
routine in the art using automated sequencing machines . 

The use of diagnostic tests for alleles allows the researcher 
or plant breeder to establish, with full confidence and 
independent from time consuming biochemical tests, whether or 
not a desired allele is present in the plant of interest (or a 
cell thereof) , whether the plant is a representative of a 
collection of other genetically identical plants (e.g. an 
inbred variety or cultivar) or one individual in a sample of 
related (e.g. breeders* selection) or unrelated plants. 



WO 00/44918 PCT/CfBOO/00248^ 

38 

In a breeding scheme based on selection and selfing of 
desirable individuals, nucleic acid or polypeptide diagnostics 
for the desirable allele or alleles in high throughput, low 
cost assays as provided by this invention, reliable selection 
5 for the can be made at early generations and on more material 
than would otherwise be possible. This gain in reliability of 
selection plus the time saving by being able to test material 
earlier and without costly phenotype screening is of 
considerable value in plant breeding. 

10 

Nucleic acid-based determination of the presence or absence of 
one or more desirable alleles may be combined with 
determination of the genotype of the flanking linked genomic 
DNA and other unlinked genomic DNA using established sets of 

15 markers such as RFLPs, microsatellites or SSRs, AFLPs, RAPDs 
etc. This enables the researcher or plant breeder to select 
for not only the presence of the desirable allele but also for 
individual plant or families of plants which have the most 
desirable combinations of linked and unlinked genetic 

20 background. Such recombinations of desirable material may 
occur only rarely within a given segregating breeding 
population or backcross progeny. Direct assay of the locus as 
afforded by the present invention allows the researcher to 
make a stepwise approach to fixing (making homozygous) the 

25 desired combination of flanking markers and alleles, by first 
identifying individuals fixed for one flanking marker and then 
identifying progeny fixed on the other side of the locus all 
the time knowing with confidence that the desirable allele is 
still present . 

30 

The present disclosure provides sufficient information for a 
person skilled in the art to obtain genomic DNA sequence for 
any given new or existing allele and devise a suitable nucleic 
acid- and/or polypeptide-based diagnostic assay. In designing 
3 5 a nucleic acid assay account is taken of the distinctive 
variation in sequence that characterises the particular 
variant allele . 
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Nucleic acid according to the invention may include a 
nucleotide sequence encoding a product involved in 
vernalization response, flowering time, leaf shape and/or 
shade avoidance response. Reducing or increasing the level of 
5 expression may be used to manipulate such a characteristic in 
a plant. This may involve use of anti-sense or sense 
regulation, discussed further below. 

Nucleic acid according to the invention, such as a VRN2 gene 
10 or homologue, may be placed under the control of an externally 
inducible gene promoter to place expression under the control 
of the user. An advantage of introduction of a heterologous 
gene into a plant cell, particularly when the cell is 
comprised in a plant, is the ability to place expression of 
15 the gene under the control of a promoter of choice, in order 
to be able to influence gene expression, and therefore 
vernalization response, flowering time, leaf shape, shade 
avoidance response, and/or other characteristic, according to 
preference* Furthermore, mutants and derivatives of the 
20 wild- type gene, e.g. with higher or lower activity than 
wild- type, may be use t d in place of the endogenous gene. 

In the present invention, over-expression may be achieved by 
introduction of the nucleotide sequence in a sense 

25 orientation. Thus, the present invention provides a method of 
influencing a physical characteristic of a plant, the method 
including causing or allowing expression of the product 
{polypeptide or nucleic acid transcript) encoded by 
heterologous nucleic acid according to the invention from that 

30 nucleic acid within cells of the plant. 

Down-regulation of expression of a target gene may be achieved 
using anti-sense technology or "sense regulation" ("co- 
suppression") . 



In using anti -sense genes or partial gene sequences to 
down-regulate gene expression, a nucleotide sequence is placed 
under the control of a promoter in a "reverse orientation" 
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such that transcription yields RNA which is complementary to 
normal mRNA transcribed from the "sense" strand of the target 
gene. See, for example, Rothstein et al, 1987; Smith et 
al, (1988) Nature 334, 724-726; Zhang et al , (1992) The Plant 
Cell 4, 1575-1588, English et al., (1996) The Plant Cell 8, 
179-188. Antisense technology is also reviewed in Bourque, 
(1995), Plant Science 105, 125-149, and Flavell, (1994) PNAS 
USA 91, 3490-3496. 



An alternative is to use a copy of all or part of the target 
gene inserted in sense, that is the same, orientation as the 
target gene, to achieve reduction in expression of the target 
gene by co- suppression. See, for example, van der Krol et 
al., (1990) The Plant Cell 2, 291-299; Napoli et al., (1990) 
15 The Plant Cell 2, 279-289; Zhang et al . , (1992) The Plant Cell 
4, 1575-1588, and US-A-5 , 231 , 020 . 

The complete sequence corresponding to the coding sequence (in 
reverse orientation for ant i- sense) need not be used. For 

20 example fragments of sufficient length may be used. It is a 
routine matter for the person skilled in the art to screen 
fragments of various sizes and from various parts of the 
coding sequence to optimise the level of anti-sense 
inhibition. It may be advantageous to include the initiating 

25 methionine ATG codon, and perhaps one or more nucleotides 

upstream of the initiating codon. A further possibility is to 
target a conserved sequence of a gene, e.g. a sequence that is 
characteristic of one or more genes, such as a regulatory 
sequence . 

30 

The sequence employed may be about 500 nucleotides or less, 
possibly about 400 nucleotides, about 300 nucleotides, about 
200 nucleotides, or about 100 nucleotides. It may be possible 
to use oligonucleotides of much shorter lengths, 14-23 
35 nucleotides, although longer fragments, and generally even 
longer than about 500 nucleotides are preferable where 
possible, such as longer than about 600 nucleotides, than 
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about 700 nucleotides, than about 800 nucleotides, than about 
100 0 nucleotides or more. 

It may be preferable that there is complete sequence identity 
5 in the sequence used for down-regulation of expression of a 

target sequence, and the target sequence, though total 

complementarity or similarity of sequence is not essential. 

One or more nucleotides may differ in the sequence used from 

the target gene. Thus, a sequence employed in a down- 
10 regulation of gene expression in accordance with the present 

invention may be a wild-type sequence (e.g. gene) selected 
% from those available, or a mutant, derivative, variant or 
l:p allele, by way of insertion, addition, deletion or 
:5r substitution of one or more nucleotides, of such a sequence. 
ff.15 The sequence need not include an open reading frame or specify 
jjr an RNA that would be translatable. It may be preferred for 
1 there to be sufficient homology for the respective anti-sense 

and sense RNA molecules to hybridise. There may be down 
y : regulation of gene expression even where there is about 5%, 
EE 20 10%, 15% or 20% or more mismatch between the sequence used and 
J" the target gene. 

Generally, the transcribed nucleic acid may represent a 
fragment of a gene, or the complement thereof, or may be a 

25 mutant, derivative, variant or allele thereof, in similar 

terms as discussed above in relation to alterations being made 
to a coding sequence and the homology of the altered sequence. 
The homology may be sufficient for the transcribed anti-sense 
RNA to hybridise with nucleic acid within cells of the plant, 

30 though irrespective of whether hybridisation takes place the 
desired effect is down -regulation of gene expression. 

Thus, the present invention also provides a method of 
modifying, affecting, altering or modulating a characteristic 
35 of a plant, e.g. vernalization response, flowering time, leaf 
shape and/or shade avoidance response, the method including 
causing or allowing ant i -sense transcription from heterologous 
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nucleic acid according to the invention within cells of the 
plant . 

The present invention further provides the use of the 
5 nucleotide sequence of VRN2, or a fragment, mutant, 

derivative, allele, variant or hornologue thereof for down- 
regulation of gene expression, particularly down- regulation of 
expression of a VRN2 gene or hornologue thereof, preferably in 

order to influence a physical characteristic of a plant, 
.0 especially vernalization response, flowering time, leaf shape 
and/or shade avoidance response* 

When additional copies of the target gene are inserted in 
sense, that is the same, orientation as the target gene, a 

.5 range of phenotypes is produced which includes individuals 
where over-expression occurs and some where under -express ion 
of protein from the target gene occurs . When the inserted 
gene is only part of the endogenous gene the number of 
under-expressing individuals in the transgenic population 

10 increases. The mechanism by which sense regulation occurs, 
particularly down -regulat ion, is not well -understood. 
However, this technique is also well-reported in scientific 
and patent literature and is used routinely for gene control. 
See, for example, van der Krol et al . , (1990) The Plant Cell 

5 2, 291-229; Napoli et al . , (1990) The Plant Cell 2, 279-289; 

Zhang et al, 1992 The Plant Cell 4, 1575-1588. 

Again, fragments, mutants and so on may be used in similar 
terms as described above for use in anti-sense regulation, 

30 

Thus, the present invention also provides a method of 
influencing a chafacteristic of a plant, e.g. vernalization 
response, flowering time, leaf shape and/or shade avoidance 
response, the method including causing or allowing expression 
35 from nucleic acid according to the invention within cells of 
the plant. This may be used to suppress activity of a product 
with ability to influence vernalization response, flowering 
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time, leaf shape and/or shade avoidance response. Here the 
activity of the product is preferably suppressed as a result 
. of under-expression within the plant cells. 

5 Aspects and embodiments of the present invention will now be 
illustrated, by way of example, with reference to the 
accompanying figures. Further aspects and embodiments will be 
apparent to those skilled in the art. All documents mentioned 
in this text are incorporated herein by reference. 

10 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 shows total leaf number (Rosette plus Cauline) of Ler 

plants (squares) , fca-1 plants (diamonds) and v r rn2-l fca-1 

15 plants (circles) after various periods of vernalization 
(measured in days) . 

Figure 2 illustrates the Red Far-Red Light plant phenotype: 

Figure 2A shows the area of the largest leaf of Ler, fca- 

0 1 and vrn2-l fca-1 plants grown under white light (W) (open 

bars) or white light with supplementary FR light (W + FR) 
(shaded bars) . 

Figure 2B shows the rosette leaf number at bolting of 
plants treated as in the experiment of which results are shown 
25 in Figure 2A. 

Figure 3 illustrates results of genetic and physical mapping 
of VRN2 in Arabidopsis thaliana. 

30 Figure 4 illustrates cosmids and genes near VRN2 in the 
Arabidopsis genome. 

Figure 5 illustrates the structure of the VRN2 gene of 
Arabidopsis t the 5K cDNA aberrant splicing, and the position 

35 and nature of the vrn2-l mutation. Exons are shown as open 

boxes. Unstranslated regions are shown as shaded boxes. 
Introns are shown as lines. 
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Figure 6 shows the sequence of VRN2 cDNA, including coding 
sequence and predicted amino acid sequence of the encoded 
protein. Putative NLSs are boxed, the putative acidic 
activation domain is underlined. The putative zinc- finger 
5 motif is doubly underlined. The positions of introns are 

indicated with arrows. The position of the vrn2~l mutation is 

circled. 

Figure 7 illustrates dCAPS Marker for the vrn2-l Mutation. A 
10 diagnostic derived CAPS (dCAPS) marker was designated for the 
vm2-l mutation. This utilizes a primer (VRN2-AZ) that 

p includes half of the recognition site for the XmnI restriction 

rr enzyme, the other half is supplied, specifically, by the 

tp sequence of the vrn2~l mutation. This results in a successful 

J?-15 restriction digestion only when using PCR amplified genomic 
PM .DNA from vrn2 - 1 mutants as a template. 

H : Figure 8 shows alignment of the Arahidopsis VRN2 amino acid 
id, sequence with similar proteins. 

5^20 Figure 8A aligns the full-length VRM2 protein with four 

y s other proteins, using 'Clustal method with PAM250 residue 
weight table, performed on 17 January 1999 at 19:19 GMT. 

Figure 8B aligns the zinc finger region, using Clustal 
method with PAM250 residue weight table, performed on 17 
25 January 1999 at 19:25 GMT. 

Abbreviations : 

At Arabidopsis thaliana, Sc Saccharomyces cerevisiae, Sp 
Schizosaccharomyces pombe, Ce Caenorhabditis elegans, Dm 
30 Drosophila melanogaster, Hs Homo sapiens, Mm Mus musculus , Rn 
Rattus norvegicus, Xm Xiphophorus maculatus . 
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SEQ ID NO:\\SEQUENCE 

5 1 Landsberg erecta VRN2 cDNA 

2 Landsberg erecta VRN2 amino acid 

3 Landsberg erecta VRN2 genomic 

4 Columbia VRN2 cDNA 

5 Columbia VRN2 amino acid 
10 6 Columbia VRN2 genomic 

7 5K (Columbia, abberant splice) cDNA 

8 5K (Columbia, abberant splice) amino acid 

9 C72616 EST (modified) cDNA 

10 C72616 EST (modified) amino acid 
15 11 AI163743 EST (modified) cDNA 

12 AI163743 EST (modified) amino acid 

13 At Hyp 2245035 (ATFCA7_4) (modified) cDNA 

14 At Hyp 2245035 (ATFCA7_4) (modified) amino acid 

15 KIAA0160 CDNA 

20 16 KIAA0160 amino acid 

Additional seq uences included in the Figures: ' 



17 Landsberg erecta VRN2 zinc finger amino acid 

25 18 At Dil9 S51478 zinc finger 1 amino acid 

19 At Dil9 S54178 zinc finger 2 amino acid 

20 At SUP U38946 zinc finger amino acid 

21 At Hyp 2191171 zinc finger amino acid 

22 At Hyp 3377806 zinc finger amino acid 
30 23 Sc Pep7 91500 zinc finger amino acid 

24 Sc TFIIIA 730931 zinc finger amino acid 

25 Sp Hyp 1351713 zinc finger amino acid 

26 Ce Hyp 255942 zinc finger amino acid 

27 Ce Hyp 2854197 zinc finger amino acid 
35 28 Ce Hyp 304459 zinc finger amino acid 

29 Dm BRCORE - NS - Z 3 zinc finger amino acid 

30 Dm GAGA 729556 zinc finger amino acid 

31 Dm ken 3550814 zinc finger amino acid 
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32 Hs ATBF-1 976347 zinc finger amino acid 

33 Hs KIAA0160 zinc finger amino acid 

34 Hs ZNF142 3123312 zinc finger amino acid 

35 Mm FOG 2252814 zinc finger amino acid 

5 36 Mm Spalt 1296845 zinc finger amino acid 

37 Rn Roaz 2149792 zinc finger amino acid 

38 Xm ZF1 532083 zinc finger amino acid 

EXAMPLE 1 

10 Characterisation and Cloning of VRN of Arabidopsis Thaliana 
and Mutant Alleles Thereof 

Isolation of vrn2 mutants 

15 Two vrn2 mutant alleles (vm2-l and vrn2-2) were isolated by 
mutagenising fca-1 seeds with EMS as described by Chandler et 
al. (Plant J (1996) 10: 637-644). WO96/38560 (PCT/GB96/01332 ) 
discloses the sequence of fca and mutant alleles and their 
cloning and characterisation. The vrn2-l fca-1 line used here 

20 has been backcrossed- to fca-1 four times. For mapping 

purposes, the vrn2-2 allele was used (at the 2nd backcross) . 

Phenotypic Characterization 
25 Vernalization 

The vernalization response of vrn2 mutant plants was 
investigated by examining their flowering time in response to 
increasing durations of vernalization treatment. 

30 

Standard vernalization conditions were used, i.e. low light 
intensity 5 /xmol m" 2 s" 1 , 8 hr photoperiod, 5±1 degree C, for 
varying periods (from between 1 and 42 days) . Similar effects 
would be observed under continuous or no light, the 
35 temperature is more important. 
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In the absence of a vernalization treatment, vrn2-l fca-1 
mutant plants showed a small but consistent delay in flowering 
compared to the parental (wild-type) fca-1 controls {Figure 1: 
vrn2-l has a higher leaf number than fca-1, a reduced leaf 
5 number correlating with a transition from vegetative to 
reproductive (i.e. flowering) state). However, following 
vernalization, this difference was greatly magnified (Figure 
1) . The response exhibited by vrn2-l fca-1 plants was 
typically a 35% reduction in total leaf number after 6 weeks 
10 of vernalization, compared to 67% reduction or fca-1 controls. 
The delay in flowering, both with and without a vernalization 
treatment, as measured by increased leaf number, was also 
observed if the days to flower i.e. the day at which the first 
floral bud was visible was used. 

15 

Red/Far-Red light perception 

The response to vernalization in Arabidopsis has been 
positively correlated with the response to different ratios of 

20 Red(R) to Far-red (FR) light; mutants and ecotypes that respond 
strongly tend to respond strongly to conditions of low R:FR 
(Bagnall, Ann Bot (1993) 71: 75-83; Martinez-Zapater et al., 
Plant Physiol (1990) 92: 770-776). This response typically 
manifests itself in two distinct ways - an acceleration of 

25 flowering time (leading to an effect that mimics the effect of 
a vernalization treatment) and a reduction in leaf area (or 
shade avoidance) . This response is believed to have evolved 
to allow plants to adapt to the availability of light 
allowing individuals to seek light when in competition with 

3 0 their neighbors. 

We examined the ability of vrn2-l fca-1 mutants to respond to 
conditions that mimic such an environment. 

35 Under these conditions, vrn2-l fca-1 plants showed a marked 

reduction in the shade avoidance response, with the mean area 
of the largest leaf decreasing by only 26%, compared to 74% 
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for the fca-l control (Figure 2A) . However, the vrn2-l 

mutation does not appear to affect all aspects of the response 
to FR light, as vrn2~l fca-1 plants showed a similar 

acceleration of flowering in response to supplementary FR 
5 light as fca-1 controls (Figure 2B) . 

These data provide indication that VRN2 plays a role in 
regulating the response to FR light, and may mediate changes 
specifically in leaf size {as flowering time is only slightly 
10 affected) under conditions of low R: FR ratios. 

Genetic Mapping 

The VRN2 gene was mapped in an F2 population derived from the 
15 cross vrn2-l fca~l crossed to fca-10 f following the procedure 

used to map the VRN1 gene (Chandler et al . supra.) . The VRN2 

gene was initially positioned using a population of 70 F2 
individuals between the RFLP markers gl3683 and mill2 (Schmidt 
et al., Plant J (1996) 9: 755-765) on the long arm of 

20 Chromosome IV (or D) , using conventional techniques. The map 
position of VRN2 was further refined by screening an 
additional 429 F2 plants with a SSLP derived from the marker 
gl9247 (Schmidt et al./ (1996) Plant J 9: 755-765) and the 

CAPS~marker g4539 (Parker et al., Plant Cell (1998) 9: 1-17). 

25 A total of 12 individual F2 plants that were recombinant 
between gl9247 and g4539 were further analyzed with RFLP 
markers gl3683 and CC36F6 (Bancroft et al • , Weeds World 

4ii:(l997)) and the CAPS marker C18 (Parket et al supra.) , 
and with two additional CAPS markers (VRN2RS, VRN2CD) 
30 generated using the published Columbia sequence (Bevan et al . , 

Nature (1998) 391: 485-488) 

(accession numbers Z97341 and Z97342) as a template. The VRN2 
gene was localized to a 24 5 kb region defined at the 
centromeric (north) end by an RFLP detected with the cosmid 
35 CC36F6, and at the telomeric (south) end by the g4539 CAPS 

marker. This interval is defined by 3 recombinant individual 
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F2 plants; 1 recombinant between VRN2 and CC36F6, and 2 
recombinants between g453 9 and VRN2 (Figure 3) . 

Physical Mapping 

5 

The genetic interval defined by CC36F6 and g453 9 is almost 
completely covered by the 3 BAC clones T1C7, I5D3 and T5015 . 
Cosmids derived from subclones of YAC EW16B10 in the binary 
vector 04541 (Bancroft et al . , supra.) (abbreviated as IB) 

10 were positioned by end sequencing, and ordered relative to the 
published sequence of the Columbia ecotype in this region 
(Genbank accession numbers Z97341 and Z97342) . We selected 
cosmids clones that extended from the complex RPP5 locus 
outwards to CC36F6 and g4539, reasoning that VRN2 was not 

15 within the RPP5 locus, which is comprised of multiple repeats 
of RPP5-like genes in both Columbia and Landsberg ecotype s 
(Bevan et al . , supra.) . 

Additional Landsberg cosmids in the 04541 binary vector 
20 covering the region not covered by the Columbia YAC subclone 
cosmids were identified by hybridization to the inserts from 
BACs T1C7 and T5015, and aligned based on end sequencing, and 
compared to the published Columbia sequence (Bevan et al., 
supra.) and to the sequence of the Landsberg ecotype in this 

25 region. An almost complete cosmid contig was generated over 
this region. 

Simultaneously with the isolation of cosmids, ordered cosmids, 
beginning with those at the centromeric end of the contig, 

30 were transformed into vrn2-2 fca-1 plants by Agrobacterium 

tumef a ci ens-mediated vacuum infiltration (Bechtold et al., C R 
Acad Sci Paris (1993T 316: 1194-1199). (Figure 3). The 
presence of the cosmid in each transgenic line (Tl plants) was 
confirmed by a cosmid- specif ic diagnostic PCR, comprising an 

35 insert specific primer (corresponding to a portion of the 
Columbia genomic DNA) and a primer present in the cosmid 
vector. 
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Cosmids introduced into vrn2-l fca-1 plants were tested for 
their ability to complement the vrn2 phenotype. T2 seeds, 

from individual Tl plants segregating kanamycin resistance at 
a 3:1 ratio, were sown on soil and vernalized for two (in some 
experiments three} weeks. Plants were then transferred to 
greenhouse conditions, and after ten days pricked out into 
individual compartments of divided trays . Total leaf number 
was determined, and cosmids were scored as complementing if 
the segregation ratio of early to late plants (when compared 
to fca-I and vrn2-2 fca-1 controls) plants was approximately 
3:1. 

Two Columbia cosmids (4A23, 2 out of 2 Tls; 6N1, 1 out of 1 
Tl) clearly complemented the phenotype of vrn2-l fca-1 

mutants, with the earliest plants flowering at approximately 
the same time as vernalized plants* 

Sequence Analysis and ORF Prediction 

The sequence in common to both IB4A23 and IB6N1 cosmids has 
previously been annotated as containing 2 complete predicted 
genes (ATDL4445W and ATDL4450W) and (presumably 
non- functional) portions of two other genes - the 3' end of 
ATDL4440W and the 3 1 end of the RPP5-like gene, CHPR 
(ATDL4460W) (Bevan et al . , supra.) Genbank accession number 

Z97342. In addition, a cognate cDNA (5K) not included in the 
annotation is present in this region, and appears to span two 
of the predicted genes (ATDL444 5W and ATDL4450W) (Bevan et 

al w supra.). However, as two cosmids in the region (1 

independent Tl line from each of cosmids IB4N6 and IB6C5) did 
not complement the mutant phenotype (Figure 4) this ruled out 
the predicted gene ATDL444 5W. This left the unannotated cDNA 
5K, and the predicted gene ATDL4450W as candidates for VRN2 . 
However, the presence the cognate cDNA 5K from the Columbia 
ecotype that overlapped both ATDL4445W and ATDL4450W 
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necessitated a re -examination of the prediction for the 
ATDL4450W gene. 

In order to define the structure of these genes, we used the 
5 NetGene2 prediction program (Hebsgaard et al . , Nucl Acids Res 
(1998) 24: 3439-3452), using "Arabidopsis" as the organism 
option (the only parameter that can be set manually) . BLAST, 
PSI-BLAST, PSORT and PROSITE programs were used to identify 
potential function domains and similarities (Altschul et al . , 

10 Nucleic Acids Res (1997) 25: 3389-3402; Bairoch et al., Nucl 
Acids Res (1997) 25: 217-221; Nakai et al . , Genomics (1992) 
14: 897-911). Default parameters of TBLASTN, PSI-BLAST and 
BLASTP were used (Expect = 10, BLOSDM6 2 matrix, gap penalty = 
11, penalty per gap 1, lambda ratio 0.85). The NCBI/GenBank 

15 database was used. The PSORT algorithm (Nakai) was used, 
using the "plant" option as the source organism (the only 
parameter that can be manually changed) . The Profile Scan 
program at PROSITE (Bairoch) was used to search for motifs in 
VRN2, with default parameters (there are no parameters a user 

20 can select, the results being "hit" or "no hit"). 

This analysis yielded predictions for two genes, 5K, a 
putative nuclear localized protein that is highly 
post- transcriptionally spliced (15 exons) , represented by the 
25 Columbia cognate cDNA; and a modified prediction for 4450, 

with 6 putative membrane- spanning domains, represented at its 
3' end by an Arabidopsis EST (accession number T22412) . 

Determinat ion of the vrn2-2 Mutation and Identification of the 
30 VRN2 gene 

In an attempt to determine which gene (5K or 4450) is VRN2, 
PCR primers were designed to amplify products encompassing the 
entire predicted open reading frame of both genes. 

35 

Three independent RT-PCR reactions using total RNA prepared 
from fca-l,vm2-l fca-1 and vrn2-2 fca-1 14 day-old seedlings 
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grown on GM plates in continuous light were performed for each 
predicted gene with a high fidelity enzyme mix (Boehringer 
Mannheim, HiFi System) . These PCR products were sequenced 
using both the primers used for PCR, and a series of internal 
primers, using the BIGDYE kit (PE Applied Biosystems) . The 
reactions were run on an ABI377 machine, and compiled using 
the SeqMan (DNAStar, Lasergene) program. 

The sequences of the PCR confirmed our prediction for both 
genes, and indicated that we had amplified across the entire 
open reading frame of 5K and ATDL4450W as anticipated. 

Several minor polymorphic differences were detected between 
the published Columbia sequence, and the Landsberg erecta 
sequence we amplified by PCR. These differences were 
consistent with the Landsberg erecta genomic sequence in this 
region. Furthermore, the Columbia cDNA for 5K appears to 
utilise a different splice donor site from that used in the 
Landsberg ecotype, and would produce a truncated, presumably 
non- functional protein (Figure 5) . However, we have also 
sequenced the Columbia 5K product derived independently by 
RT-PCR, and this appears to use the same splice site as 
Landsberg, and should encode a functional protein. A 
consistent difference between the vrn2 mutants and fca-l was 

detected in the 5K PCR product, a G to A change at position 
1201 of the predicted cDNA in vrn2-l fca~l (Figure 5) . We are 

currently investigating the nature of the mutation in the 
vra2-2 allele. This type of mutation, a single base-pair 

change, is commonly observed following EMS mutagenesis. This 
mutation converts a TGG codon (Tryptophan) to stop codon 
(TGA) , and would result in the production of a truncated 
protein of 322 amino acids in the vrn2-l mutant, compared to 

44 3 amino acids of wild type VRN2 (Figure 6) . The presence 
of this mutation indicated that 5K was likely to be VRN2 . The 
presence of the vrn2-l mutation in the genome of vrn2-l fca-1 

mutant plants was confirmed by a derived CAPS (dCAPS) 
(Michaels et al . , Plant J (1998) 14: 381-385; Neff et al . , 
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Plant J (1998) 14: 387-392) marker specific for the vrn2-l 
mutation (Figure 7) . This diagnostic test is specific for the 
vrn2-l mutation, as it detects wild type VRN2 in both fca-1 
and vrn2-2 fca.-l mutants. 

5 

Analysis of the VRN2 gene 

To gain an insight into the possible function of the VRN2 
gene, and how the vm2-l mutation may affect the function of 
10 the VRN2 protein, we compared the amino acid sequence of VRN2 
to several databases of protein and translated nucleic acid 
sequences using BLASTP and TBLASTN programs at NCBI, using the 
default parameters as noted above. 

15 Several molecules with a significant degree of similarity were 
identified (Table 1 and Table 2) . 

One such gene, represented by a human cDNA (KIAA0160) shares 
homology with VRN2 over a short region near the amino terminus 

20 of VRN2 (amino acids 63 to 132), and a longer, but less 

conserved region of homology towards the carboxy terminus 
(amino acids 263-366) (Figure 8a) . Closer examination of the 
amino terminal conserved region revealed that it matches the 
consensus of a zinc-finger motif. Such motifs can take a 

25 variety of forms, but all co-ordinate zinc atoms through two 
cysteine residues, and two cysteine or histidine residues. 
VRN2 falls into the latter class, having a C2H2 motif 
comprised of two cysteines separated by 2 amino acids, and two 
histidines separated by two amino acids (Mackay et al w TIBS 

30 (1998) 23: 1-4) . 

Zinc -finger motifs are known to be capable of mediating both 
protein-DNA and protein-protein interactions. The zinc finger 
motif of VRN2 does not closely resemble the large EPF family 
35 of Arabidopsis C2H2 zinc fingers from Arabidopsis , which have 
a highly conserved QALGG motif in the middle of the zinc 
finger (Kubo et al w Nucl Acids Res (1998) 26: 608-615) (Figure 
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8b) . In addition, VRN2 differs from the EPF proteins in that 
VRN2 has a single zinc finger motif, whereas most members of 
the EPF family (with the exception of SUP and AtZFPl) have 
between two and four zinc fingers (Kubo et al., supra.). This 
5 amino- terminal region (amino acids 63-132), and particularly 
the zinc- finger motif (amino acids 90-111) may thus represent 
a domain that mediates protein-protein or protein-DNA 
interactions . 

10 The carboxy terminus of VRN2 • (amino acids 263 to 366) is 

similar to several other candidate genes (Table 1 and Table 
2) . As mentioned above, there is limited homology to the 
human predicted protein KIAA0160. The molecule showing 
greatest homology to VRN2 is an EST sequence from poplar 
15 (Populus tremula L. x Populus tremuloides Michx (Accession 
Number AI163743) (Sterky et al., PNAS USA (1998) 95: 
13330-13335) which has 52.8% identity over 127 amino acids 
(Table 1) , as calculated with the BLASTP algorithm using 
default parameters (as noted above) . VRN2 also shows 
20 significant similarity to a predicted Arabidopsis protein 

(ATFCA7_4, Accession Number 2245035) (Bevan et al., supra.), 
which is quite close to VRN2 on chromosome 4, only 30 kb away 
towards the centromere. A close examination of the sequence 
near this gene revealed that the prediction as annotated may 
25 be incorrect, as the use of a different splice site, resulting 
a different carboxy terminus to the protein, increases the 
degree of homology with VRN2 . The similarity of these two 
Arabidopsis genes raises the possibility that VRN2 may be a 
member of a gene family in Arabidopsis, and their close 
30 position suggests that these genes may have arisen following a 
duplication. Arguing against this notion is the observation 
that these two genes, VRN2 and ATFCA7_4 , are transcribed in 
opposite directions. A rice EST (C72616) also shares 
significant similarity with the carboxyl region of VRN2 , 
suggesting that this region may form an evolutionarily 
conserved domain present in monocots and dicots. 



35 



SUBSTITUTE SHEET (RULE 26) 



WO 00/44918 PCT/GBOO/00248 

55 

This conserved carboxy region is predicted to be highly 
charged, as it is composed of a large number of acidic 
residues (D and E) . This highly charged region is highly 
similar in the Poplar and rice ESTs {Table 2) . Such acidic 
5 regions are found in a number of eukaryotic transcription 

factors, and often function as activation domains (Hahn, Cell 
(1993) 72: 481-483). It is therefore possible that VRN2 may 
function as a transcription factor, given it has both a 
DNA-binding motif (or protein binding) (amino acids 63-132) 

10 and a putative activation domain (amino acids 263-328) . 

Furthermore, the amino portion of VRN2 contains two predicted 
nuclear localization signals (NLSs) (Figure 6) . The first is a 
simple 4 residue basic signal, while the second is a 
bi-partite signal, that fits the consensus (R/K) (R/K) N10 (R/K) 4 

15 (Dingwall et al . , TIBS (1991) 16: 478-481). 

EXAMPLE 2 

Production and Characterisation of Arabidopsis Transgenic for 
VRN2 

20 

VRN2 cDNA in the sense orientation is cloned into plant 
expression vectors SLJ4D4 and SLJ4K1 (Jones et al. , (1992) 
Transg. Res. 1, pp 285-297) according to the teaching of 
Sambrook J et al (1989) Molecular Cloning: A Laboratory 
25 Manual, Cold Spring Harbor Laboratory Press, Cold Spring 

Harbor, NY. SLJ4D4 and SLJ4K1 place the VRN2 cDNA under the 
control of the Ca MV 35S promoter and include terminator 
sequences from octopine synthase (ocs) and nopaline synthase 
(nos) genes, respectively. 

30 

Antisense constructs are produced in the same manner except 
that the VRN2 cDNA is inserted into the expression vectors in 
the opposite orientation . 0 

35 The VRN2 expression cassettes are then subcloned separately 
into the binary vector SLJ1714 (Jones et al . , supra), and 
mobilised into Agrobacterium strains by tri -parental mating 
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according to the teaching of Hoekema et a2 , , (1983) Nature 
303, pp 179-180. Arabidopsis are transformed with the 
Agrobacterium strains carrying VRN2 expression constructs 
(either sense or antisense) following the teaching of Bechtold 
5 et al., (1993) C R Acad, Sci. Paris 316, pp 1194-1199. 

Arabidopsis plants are assayed for changes in their response 
to changes in the ratio of far red light, essentially as 
described by Halliday et al . , (1997) Plant J. 12, pp 1079- 
10 1090. 

Resul ts 

Differences in vernalisation requirement and response are 
15 observed in Arabidopsis plants transgenic for VRN2 (sense or 
antisense orientation) relative to Arabidopsis plants 
transformed with empty vectors and non- trans formed Arabidopsis 
plants . 

20 EXAMPLE 3 

Production and Characterisation of Tobacco Transgenic for VRN2 

VRN2"cDNA in the sense orientation is cloned into plant 
expression vectors SLJ4D4 and SLJ4K1 as described in Example 
25 2. 

Antisense constructs are produced in the same manner except 
that the VRN2 cDNA is inserted into the expression vectors in 
the opposite orientation. 

30 

VRN2 expression cassettes subcloned separately into the binary 
vector SLJ1714 (Jones et al supra) , are mobilised into 
Agrobacterium strains by tri-parental mating as in Example 2, 
and tobacco plants are transformed using the Agrobacterium 
35 strains carrying VRN2 expression constructs {either sense or 
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antisense) following the teaching of Horsch et al . , (1985) 
Science 227, pp 1229-1231. 

Tobacco plants are assayed for changes in their response to 
5 changes in the ratio of far red light, essentially as 

described by Halliday et al. f (1997) Plant J. 12, pp 1079- 
1090. 

Resul ts 

10 

Differences in vernalisation requirement and response are 
observed in Tobacco plants transgenic for VRN2 (sense or 
antisense orientation) relative to Tobacco plants transformed 
with empty vectors and non-transformed Tobacco plants. 

15 

EXAMPLE 4 

Production and Characterisation of Brassica (Oil Seed Rape, 
Winter Type) Transgenic for VRN2 

20 Sense and antisense constructs are used to generate 

Agrobacterium strains carrying VRN2 expression constructs 
(either sense or antisense) as described in Example 2 and 
Example 3, and are used to transform oil seed rape following 
the teaching of Moloney et al . , (1989) Plant Cell Rep. 8, pp 

25 238-242. 

-Results 

Differences in vernalisation requirement and response are 
30 observed in oil seed rape plants transgenic for VRN2 (sense or 
antisense orientation) relative to oil seed rape plants 
transformed with empty vectors and non- transformed oil seed 
rape plants. 



35 



WO 00/44918 PCT/GB00/00248 

58 

EXAMPLE 5 

Production and Characterisation of Rice Transgenic for VRN2 

VRN2 cDNA in the sense or antisense orientation is cloned into 
constructs and used to generate respective Agrrobacteriuxn 

strains. The Agrobacteri urn strains carrying VRN2 expression 

constructs (either sense or antisense) are used to transform 
rice following the teaching of Kohll A et al (1998) Proc . 
Natl. Acad. Sci. USA 95, pp 7203-7208. 

Results 

Differences in vernalisation requirement and response are 
observed in rice plants transgenic for VRN2 (sense or 
antisense orientation) relative to rice plants transformed 
with empty vectors and non- trans formed rice plants. 

EXAMPLE 6 

Production and Characterisation of Wheat Transgenic for VRN2 

Agrobacterium strains carrying VRN2 expression constructs 
(either sense or antisense) are generated as in preceding 
Examples and are used to transform wheat following the 
teaching of Becker D et al., (1994) Plant J. 5, pp 299-307. 

Results 

Differences in vernalisation requirement and response are 
observed in wheat plants transgenic for VRN2 (sense or 
antisense orientation) relative to wheat plants transformed 
with empty vectors and non- transformed wheat plants. 

METHODS AND MATERIALS 
Plant Growth 

For vernalization treatments, seeds were sown on a damp layer 
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of fine grit (Levington 1 s M3) on wet soil in individual pots, 
and vernalized for increasing durations at 4°C, 8hr light :16hr 
dark, 5 /xmol irf 2 sec" 1 light intensity. Seed sowing was 
staggered, with all plants removed from the vernalization 
5 -conditions simultaneously. Following vernalization, plants 
were placed into a controlled environment chamber 
(Gallenkamp) , 20°C, 16 hr light: 8hr dark 90 /xmol rrf 2 sec" 1 
light intensity. Plants receiving no vernalization treatment 
were stratified for 2 days under vernalization conditions, and 

10 grown for two days prior to transfer into the growth cabinet. 
Plants were grown for 10 days, and then pricked out into 
individual compartments of P40 trays. The flowering time, as 
measured by counting total leaf number (i.e. rosette and 
cauline leaves) was determined once the primary inflorescence 

15 had elongated sufficiently. 

The phenotype of vrn2-l fca-1 plants was examined under 

different ratios of red to far-red (R:FR) light. Plants were 
stratified for 2 days, then grown for 10 days under continuous 

20 light, and then transferred to separate growth chambers with 
white light (W) , R:FR ratio of 5.8, with or without 
supplementary FR light (W+FR) , R:FR ratio 0.08. The number of 
rosette leaves was determined to measure flowering time, and 
the area of the largest leaf was used as an indicator of the 

25 shade -avoidance response. 

Mapping 

VRN2 was positioned on the long arm of Chromosome 4 (or D) 
through linkage to the RFLP marker m506 (Chang et al . , Proc 

30 Natl Acad Sci USA (1988) 85: 6856-6860), in progeny of a cross 
between vrn2-l fca-1 (Ler background) and fca-10 (Ws 

background). Further RFLP markers in this region (Liu et al . , 

Plant J (1996) 10:733-736) were used to refine the position of 
VRJSI2 . We used standard RFLP techniques, using 32P~labeled 
3 5 cosmid probes, and a Phosphorlmager detection system. A 
Ler:Ws RFLP was detected when using EcoRI digested genomic 
DNA, and either gl3683 or CC36F6 as probes. The gl9247 marker 
is a SSLP marker (using the primers gl9247F and gl9247R, see. 
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below) , with Ler producing a PCR- amplified band of 
approximately 750 bp, while Ws produces a band of 862 bp. 
Fine mapping of VRN2 was performed using a series of 
PCR-derived markers, based on the Columbia genomic sequence in 
5 this region (Bevan et al . , supra.). The VRN2CD marker was 

amplified with two primers, VRN2-C and VRN2-D, and digested 
with Ddel. This produces a CAPS marker, with Ler producing 
bands of (approximately) 480 bp, 290 bp and 190 bp, while Ws 
produces bands of (approximately) 33 0 bp, 290 bp and 150 bp. 
10 The VRN2RS marker is a dominant marker for Ws (i.e. Ler:Ws 
heterozygotes cannot be distinguished from Ws) , produced by 
amplifying genomic DNA with VRN2-R and VRN2-S (see below) , and 
digesting the product with MboII. This produces a single 

predominant band (and several much smaller unresolved bands) 
15 in Ler of approximately 400 bp, while Ws produces two bands, 
400 bp and 300 bp. 

Cosmid I sol at ion 

Cosmids covering the region not covered by the subclones of 
20 EW16B10 YAC were identified through hybridization to BAG 
inserts derived fronuBACs T5015 and T1C7. BAC DNA was 
purified and the insert isolated following digestion with NotI 

and separation by pulsed field gel electrophoresis (PFGE) as 
described in (Bancroft et al . , supra.) . Purified BAC inserts 

25 were labeled with a32P-dCTP by random primed labeling, and 
hybridized to arrayed grids of a Ler genomic cosmid library. 
Positively hybridizing cosmids were identified, and rescreened 
following restriction enzyme digestion, southern blotting and 
hybridized to the BAC insert probe used initially, Cosmids in 

30 the region of interest were selected, and genomic sequence was 
obtained from the ends of the insert using the BIGDYE cycle 
sequencing kit, and T3 and T7 primers, whose sequences flank 
the genomic DNA insert site. This sequence was aligned with 
that of the Columbia genomic sequence to accurately position 

35 the cosmids. 
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Comp 1 ement at ion 

Cosmids in the 04541 binary vector were mobilized into 
Agrobacterium tumefaciens (strain C58C1 RifR by tri-parental 
mating (Hoekema et al . , Nature (1983) 303: 179-180). vrn2-l 
5 fca-1 plants were transformed with these Agrobacterium strains 
by vacuum infiltration (Bechtold et al, supra.). Transgenic 
Tl plants were selected on GM with Kanamycin (50 mg/mL) , and 
transferred to soil when they had reached the 3-4 leaf stage. 
The presence of each cosmid in the transgenic lines was 
10 confirmed using either a Ler/Col specific polymorphism (CAPS 
or SSLP marker) or more commonly, through the use of a 
specific diagnostic PCR reaction, using a primer present 
within the cosmid insert sequence and a primer present in the 
cosmid flanking the insert site. Transgenic plants were also 
15 tested for the presence of the fca-1 mutation , as the TO 
plants should carry this mutation. Together, these two 
methods ensured that only vrn2-l fca-1 plants carrying the 
desired cosmids were analyzed further. We aimed to produced 5 
to 6 independent Tl transf ormants for each cosmid, but for 
20 some cosmids only a single line was produced. However, as 
complementation was observed before we had generated Tls for 
all of the cosmids, we are continuing to produce additional 
transgenic Tl plants carrying the cosmids near VRN2 . Tl 
plants were grown in a controlled environment chamber 
25 (conditions as above), and allowed to self. T2 seed were 
collected, and analyzed for the segregation of Kanamycin 
resistance or sensitivity on GM plates containing Kanamycin 
(as above) f scored 14-20 days after germination. Progeny from 
Tl plants that segregated a 3:1 ratio of resistant to 
30..,- sensitive plants were tested for their ability to complement 
the vrn2-l mutant phenotype, by vernalizing for 3 weeks and 
recording the total leaf number . 

Sequencing QRFs 

35 Two potential open reading frames were sequenced following 

RT-PCR using RJsfA isolated from fca-1 and vm2-l fca-1 plants. 
cDNAs were reverse transcribed, primed by a dT12-18 primer, 
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and specific primers were then used to PCR amplify regions 
corresponding to the ORFs for ATDL4450W and 5K (VRN2) . The 
Boehringer Mannheim HiFi PCR system was used to increase the 
fidelity of amplification- The primers VRN2-AL and VRN2-AM 
were used to prime ATDL4450W, and VRN2-A1 and VRN2-AJ for 5K. 
For 5K # the PCR reaction for the full length product 
(VRN2AI -VRN2AJ) was inefficient, so subsequent v reactions were 
performed to amplify the cDNA in two overlapping fragment, 
using the primer combinations VRN2-AI with VRN2-AS; and 
VRN2-AO with VRN2-AJ * The PCR products were isolated and 
purified^ and sequenced directly using the BIGDYE sequencing 
kit tPE Applied Biosystems) . At least two independent PCR 
products were sequenced from each allele. We sequenced the 
ATDL4450W PCR products with the amplifying primers VRN2-AL 
VRN2-AM, and VRN2-AU through VRN2-AX* The 5K cDNA was 
sequenced with VRN2-A1, VRN2-AJ and VRN2-AO through VRN2 -AT . 
Sequences were aligned into contigs using the DNAStar software 
package (LaserGene) . 

Sequence Comparisons 

EST and cDNA sequences were first translated using the MapDraw 
program of DNAStar (LaserGene) . This revealed several regions 
where the sequence appeared to be incorrect, as small changes 
to the described sequences drastically improved the similarity 
to VRN2. Modified nucleic and amino acid sequences were 
initially aligned using programs within the MegAlign Package 
of DNAStar (LaserGene), as follows: 

Amino acid sequences were initially aligned using the Clustal 
V method (Higgins and Sharp (1989) CABIOS vol. 5, no. 2, 151- 
153) . Default parameters were used: Gap penalty 10, Gap 
length penalty 10, ktuple 2, and the PAM 250 residue weight 
table. Alignment was refined by eye and hand, making minor 
adjustments to position of gaps to improve alignment. 

Nucleic acid (EST and cDNA) sequences were initially aligned 
with the Hein method (Hein (1990) Methods in Enzymology, vol. 
183, 626-645) , using default parameters: Gap penalty 11, Gap 
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length penalty 3, ktuple 2, and the weighted residue weight 
table. Where the nucleotide alignment {when translated) did 
not correspond to the amino acid alignment, the positions of 
gaps were adjusted to correspond with the gaps in the amino 
acid alignments, 

dCAPS Marker for the vrn2 - 1 Mutation 

A derived CAPS {dCAPS ) marker was designed that was specific 
for the vrn2-2 mutation. Following PCR amplification from 

genomic DNA with the primers VRN2-AY and VRN2-AZ, the 170 bp 
product was digested overnight with XmnI restriction enzyme, 
and the products resolved on a 4% agarose gel. Wild type 
plants (VRN2) produce a single band of 170 bp following 
digestion, while vrn2-2 mutants produce two bands, of 137 bp 
and 3 3 bp. 

Primers Used to Identify VRN2 

As indicated on Figure 3 and Figure 5. All primer sequences 
are indicated 5 1 to 3 1 . 



gl9247F 


ACT 


GTT 


CGT 


CTC 


CTT 


CAT 


CAT 


G 




gl9247R 


TTG 


CTT 


GCC 


TGA 


AAA 


AAG 


TAT 


G 




VRN2-C 


TGT 


CGA 


TAT 


GCG 


ACC 


AGT 


ACC 






VRN2-D 


CAG 


GCT 


TAG 


ACC 


CAA 


TTG 


ACC 






VEN2-R 


AGG 


TAG 


GAT 


CCG 


ACA 


TCG 


TCT 


TCT 


TAT 


VRN2-S 


CTC 


TTG 


AAT 


TCA 


AAA 


CTA 


TTC 


CTA 


CTC 


VRN2-AI 


GCC 


AAT 


CGG 


TGT 


TTT 


CGC 


AGC 


TTT 


C 


VRN2-AJ 


AAG 


AAT 


AAG 


TTA 


CAA 


TCC 


GAT 


AAA 


TCG 


VRN2-AL 


CAG 


TGG 


TTG 


AAG 


CTT 


AAG 


GAG 


G 




VRN2-AM 


GCA 


ATG 


AAT 


AAA 


TCA 


TAA 


TCT 


TGG 




VRN2-AO 


TCT 


ACT 


GGG 


ATG 


GTA 


GTT 


TTC 






VRN2-AP 


ATA 


TCC 


CGA 


GGC 


AAC 


AGA 


GCT 


TG 




VRN2-AQ 


CAT 


CTT 


TGG 


AAC 


TCG 


TTT 


G 






VRN2-AR 


CTC 


AGT 


TGT 


AAT 


AGT 


TGC 


CC 






VRN2-AS 


AAG 


AGT 


GGG 


CTA 


TGG 


CTG 


G 






VRN2-AT 


GCA 


ACT 


CTT 


TCT 


CGT 


AAA 


ATC 


TTG 




VRN2-AU 


GCC 


TCC 


ATA 


ACT 


GTC 


ATC 


ACA 


TC 




VRN2-AV 


TTT 


CAT 


TGG 


TCA 


TGG 


GAT 


GG 







WO 00/44918 PCT/GB00/00248 

64 

VRN2-AW GAC TTC AGA GAT GGG TTT ATG C 

VRN2-AX TCC ATA TCT AGC TCC TTC GCC 

VRN2-AY TGC GTT CAT TAA GTA GGC AAC AGA AAA TGG 

VRN2-AZ GAG AAG TAG TTA CCT TTG TTT TCT TAC AGA AGA GT 
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CAAGCTTCTTCAATTTTGCTTGCTCTCTCTTACACAGCCAATCGGTGTTTTCGCAGCTTTCA 
GGCCTCAATCCAAGACATTCTATATAAGCATATTGCAGAAGAGGCGGTTCTAATTGTTGCAT 
TGAGTTTATCGCTATGACGTAGGGAAATTCTAATTTAGGGGAGGCCTCAGAGTTTGCACTAA 
CTTCATAATCGGCTCTTGACGTTGTTGAGTGTAATTGAACAAGAATGTGTAGGCAGAATTGT 
CGCGCGAAATCCTCACCGGAGGAAGTGATTTCAACTGATGAGAATCTCTTGATATATTGTAA 
ACCTGTTCGACTATATAACATCTTTCACCTTCGCTCTCTAGGCAACCCATCGTTTCTTCCAA 
GATGCTTGAACTACAAAATTGGAGCAAAGCGCAAAAGAAAGTCAAGATCTACTGGGATGGTA 
GTTTTCAACTATAAGGATTGTAATAACACATTACAGAAAACTGAAGTTAGGGAGGATTGTTC 
TTGTCCATTTTGCTCTATGCTATGTGGTAGCTTCAAGGGGCTGCAATTTCATTTGAATTCAT 
CTCATGATTTATTTGAATTTGAGTTCAAGCTTTTCGAAGAATACCAGACAGTTAATGTTTCT 
GTAAAACTTAATTCCTTCATATTTGAGGAAGAAGGAAGTGATGACGATAAATTTGAGCCCTT 
CTCTCTCTGCTCGAAACCTCGTAAGCGGAGACAAAGAGGTGGCAGAAATAACACCAGGAGAC 
TTAAAGTATGCTTTTTACCGTTGGATTCACCCAGTTTAACTAATGGCACAGAAAATGGAATC 
ACCCTACTTAATGATGGAAACCGTGGTTTAGGATATCCCGAGGCAACAGAGCTTGCTGGACA 
ATTTGAGATGACCAGCAACATTCCACCAGCCATAGCCCACTCTTCTCTGGACGCTGGTGCTA 
AAGTTATATTGACAAGCGAAGCTGTGGTCCCTGCTACTAAGACAAGAAAGTTATCTGCTGAG 
CGATCAGAGGCTAGAAGCCACCTACTTCTTCAGAAACGCCAATTCTATCATTCTCACAGAGT 
CCAGCCAATGGCGCTTGAGCAAGTAATGTCTGACCGGGATAGCGAGGATGAAGTCGATGACG 
ATGTTGCAGATTTTGAAGAT-CGCCAGATGCTTGATGACTTTGTGGATGTGAATAAAGATGAA 
AAGCAATTCATGCATCTTTGGAACTCGTTTGTAAGAAAACAAAGGGTTATAGCAGATGGTCA 
TATCTCTTGGGCATGTGAAGCATTTTCAAGATTTTACGAGAAAGAGTTGCACCGTTACTCAT 
CACTCTTCTGGTGTTGGAGATTGTTTTTGATTAAACTATGGAACCATGGACTTGTCGACTCA 
GCC^CCATCAACAACTGCAATACCATCCTCGAGAATTGCCGTAATAGCTCAGACACCACCAC 
CACCAACAACAACAACAGTGTGGATCGTCCCAGTGACTCAAACACCAACAACAATAACATTG 
TGGATCATCCCAATGACATAAACAACAAGAACAATGTTGACAACAAGGACAATAACAGCAGA 
GACAAAGTAATTAAATAGGAAAATCTCCGGCTTTTATGATACCGATTTATCGGATTGTAACT 
TATTCTTCTTTCTTAAAAAATTGTTTAGGAGCAAAGAAATTTTTTATATGTTAGTGTATTCA 
ACTGATTACATTTTTAGTTAAAAAAAAAAATGGATTCTGCTTATAACT 



SEQ ID NO: 2 

MCRQNCRAKSSPEEVISTDENLLIYCKPVRLYNIFHLRSLGNPSFLPRCLNYKIGAKRKRKS 
RSTGMWFNYKDCNNTLQKTEVREDCSCPFCSMLCGSFKGLQFHLNSSHDLFEFEFKLFEEY 
QTVNVSVKLNSFIFEEEGSDDDKFEPFSLCSKPRKRRQRGGRNNTRRLKVCFLPLDSPSLTN 
GTENGITLLNDGNRGLGYPEATELAGQFEMTSNIPPAIAHSSLDAGAKVILTSEAWPATKT 
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RKLSAERSE^SHLLLQKRQ 

DVNKDEKQFMHLWNSFVRKQRVIADGHISWACEAFSRFYEKELHRYSSLFWCWRLFLIKLWN 
HGLVD SAT I NHCNT I LENCRNS SDTTTTN1SINNS VDRP S DSNTNNNNI VDHPND 1 3STNKNNVDN 
KDNNSRDKVIK 



SEQ ID NO: 3 

AAAGAGAATGCTTTGACTCTCTCATTGGTCAAACCTGACTGTATTTATATGCGTTATTGTGT 
1 0 GGTAAAGTTTCGACCTTTGACTTTACAAGTTGGCGTTAAGAAGAGAGATGCGTAGATCAGCG 
AGTGGTTCGAGAGTTTTGGATCATTTTCCCCCGACTTCACGGTCTCCACGTCGATCTCAGAG 
CATT^ATCATTGGAAGATGATGTGGAGGTGCTTTTGCCTAGGTACGATCCGAATTCTCAAG 
CGGGGAAGAGAGAGAAGTCAAGATTCAGATTTGCAGAAAACGTCATCCATTTGATTCCTCTC 
ATTCTTCTTCTCTGTATCGCAATCCTCTGGCTCTCCTCTTATTCAGGTAAGCCGAGAAATTG 
1 5 ATTCAATCTCTATGAATCCATAATTGATATGTGAAACTTAATTAGGGATTTTACAAAGGCT C 
ATATGGATATGATATGAGGATCGAGATGTCTCTGTAACATTAGAATCTTGTGTTGAATTATT 
GTTTCAATTTGTTCATATTATACTAAACCGGTGATGGATTTGGAATTTGTCAGCAGCGTTAA 
GGAGTTGAGTTCAAGAAGCAACATGTTGTCTTGTCTCCATGGGAACTCATCATATTCAGTTT 
TGGGAAAGGAAACAATTTTTTTTACCGCCGGTGATTATGTGCCGCAAACCATACGTAACTTT 
20 TGTAATTTTCGGTTCTGTAGACACATAAAAGGATCTCTCGTTTTCATGAAATGTATGTTTAA 
TATTTCACTATACATCACACAACTCAAGTAGAAAACACTGATGGTTATCCATTAATCATCAT 
TCTATTGGTCGAAAACAAGGATTAGTTTCAACTTATTGCTACCTTAGTGATTAGATGTTCCT 
GTGAGTTTCAGCTAGCCAAGTCAACTAGAGTTAAACAATGGAATCAAAATACATATTCAGTA 
ATTTATTTTAAACTCTGACTATTTATGTAAACAAAAATGGAAATTAAAATTGAAGGTCATGA 

2 5 AGATTCTATTCTTAGTATGAAAAGTATAGATCAATGATAAAAGTATATACC AGAACAGTGGT 

GGATCTAGAAACATATTTAGTATATGGCACAATATATTTAACATATACAAATTTTAATCTAA 
AAGTTCTATTCATTTATGAAAAGACWTCTGAATGAAGCAAATTTATTTGATGTGTTAATCAT 
CCATTTATGTGTTAATCAGCCATTGATGTTAGTATAGTACTCTATGCTAACATAATTTTTTT 
ATACTATAAATTAAAAAAATAGGTAAGAAAAGAAAAATAGATTAATATAAAAAGCATTTTAT 

3 0 TAGCTGAAATAAATAAAATGAAAGAAGATAATAACTAATTGACTAAAAAATTAGTAGAGCAT 

ATGGGGCACAATACACTAAGTATTTCATCTTTACTATAAAATGTAACAAATTTCAAAATTAT 
CAAACTGTATATAGGGCACGTGCCTAGGTACCAATAGACGTACGTCCGCCCTGAAATAAGTT 
GGTGAATATGGTTTTAATTCCTCTAATACTCACTGTACTGCCATGGTAGAGGTGAAAAAAAC 
AATTTTAGAAATATTATAATGGATTAAGCTGTCCAAGTTGGTCGTATTTTCTTTACATTTTA 
35 TTAACTAATAAACATAAATAAGTTCAACTATTTATTGACTAGTAATAATACGTGTAAAATGT 
CTATTGGTTTAAAATATGGGCCATAAGGCCCAGACTTGAAAAAAAAACTTGAAACCCAAAGT 
TATATTTTTACTTGTTTCTTCTTTCTCAGTGAATATCTCCCAATCAAGCTTCTTCGATTTTG 
CTCTCTCTTACACAGCCAATCGGTGTTTTCGCAGCTTTCAGGTTTGTCTCAATCTCAAATTA 
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AATCGGAGTCAAGTAATAACAATTGATAAACCTAATTGTTTCCATTGTATTGTAAGATTTGA 

AATTTTGCTGTAGATCCGGAATCGAATTCTAGTTCTGGAATCGTTGATCTCGATGGAATTTT 

TTTTTTAAGATTTCTTCTTACACATTTGGTTCAAAAGATCACATAGTTTTATTTTAATTTGA 

TAAGTATGATGA.TTCTGCTAAGTGGCATTGGATAAAGTTTTCGTTTTTGCAATACGTCTAAA 

CTTGTCTATGTCTTGAATGAACTCTCTGAGTTGCTTAAAAAGTCTTGTGCTTTCTTTATTAC 

ACAGGCCTCAATCCAAGACATTCTATATAAGCATATTGCAGAAGAGGCGGTTCTAATTGTTG 

CATTGAGTTTATCGCTATGACGTAGGGAAATTCTAATTTAGGGGAGGCCTCAGAGTTTGCAC 

TAACTTCATAATCGGCTCTTGACGTTGTTGAGTGTAATTGAACAAGAATGTGTAGGCAGAAT 

TGTCGCGCGAAATCCTCACCGGAGGAAGTGATTTCAACTGATGAGAATCTCTTGATATATTG 

TAAACCTGTTCGACTATATAACATCTTTCACCTTCGCTCTCTAGGCAACGTATGATTTGCCT 

TCCTCTCTCATCATATTAGCTCAGTAATCTTTCATCTCCTGTGTAGATCACCCACTAATAGT 

TTGAGTTTGCTAAGCTGATTATGGTCTGATTCATGGCGAGTGTGTGCTTCTTTTGTCTCCTA 

AATTTGAACTTGTTGTTTGTTGTTGCAGCCATCGTTTCTTCCAAGATGCTTGAACTACAAAA 

TTGGAGCAAAGCGCAAAAGAAAGTATGTTTTCTTCTTGAATGTAGCTGCTACAGTGATATGT 

TATTTATCTTACTTCTAATATGGAAGCTGATGACCTATTTTATCTTTGTTGAGTAGATATGG 

ACATAATGAATGGTTTCTTCTTTGTTCATGCTATAAACTTACATTTTATAAAATTGTGTTTT 

GGTTAGGTCAAGATCTACTGGGATGGTAGTTTTCAACTATAAGGATTGTAATAACACATTAC 

AGAAAACTGAAGGTTAGTCTTTTTCTGTTCGTCGACAAAATTCGATGTCAATGTCTATGTTT 

CTCTAGATGATTTGTTATTTACTATTTTTTTCTGTATTGTCATGCAGTTAGGGAGGATTGTT 

CTTGTCCATTTTGCTCTATGCTATGTGGTAGCTTCAAGGTGGGCAAGTATTACAACTGAGGT 

TTCTTCCGGGGCCTTTCATATCTAACACTGTGAAATGCTACTGCTGTTTCATGCTGTATACT 

TTCACTGTTTGGTTACATATTTTTGTGTTTGTTGTTTGTCTTCTCACTCTTTTCGAACTGCT 

GAGTGTGTGCTTATCTGAGAAAACATGTCCCAGATGGAGCTTACAACCAATTGTCTTGTGTC 

TATGCAGGGGCTGCAATTTCATTTGAATTCATCTCATGATTTATTTGAATTTGAGTTCAAGG 

TATGTGGTTTTATGGAAATTCTTGATTTGCTATGC-CTTTATTAATGAGGTTATAGTTAAAAA 

AGGGTCTTTCCTATTGTAGCTTTCGGAAGAATACCAGACAGTTAATGTTTCTGTAAAACTTA 

ATTCCTTCATATTTGAGGTCAGTTACTTTAAACTTGGTTAATTGGGAAATCCGATAGCTGGT 

GAAAATTTTGTTTATATTCCATCCTTATTTGTACTAGGAAGAAGGAAGTGATGACGATAAAT 

TTGAGCCCTTCTCTCTCTGGTAACCCTCAGAACCCCTTCGATTAAATACCTTAATAGCAGTA 

ACTCCTTGCTTCTCTTGTCAGTACATCTCTGTAAATCCAACCATAATGTTTTGCAGCTCGAA 

ACCTCGTAAGCGGAGACAAAGAGGTGGCAGAAATAACACCAGGAGACTTAAAGTATGCTTTT 

TACCGTTGGATTCACCCAGTTTAACTAATGGCACAGAAAATGGAATCACCCTACTTAATGAT 

GGTAAAATCATATCTTCTTCTGTGCGTTCCTTGTGGCTTAGAACTTCATATTACAGAAGAAG 

ATACAATGGCCTGATTGTTTAGTTTTTGTACTTCTCCTCGCATTCTTCTTGCGAGGGTATTG 

TTACCAGAACTGATGTACAAAATTAATGGCATGCTACAGGAAACCGTGGTTTAGGATATCCC 

GAGGCAACAGARCTTGCTGGACAATTTGAGATGACCAGCAACATTCCACCAGCCATAGCCCA 

CTCTTCTCTGGACGCTGGTGCTAAAGTTATATTGACAAGCGAAGCTGTGGTCCCTGCTACTA 

AGACAAGAAAGTTATCTGCTGAGCGATCAGAGGCTAGAAGGTTTGTTCATCATGACACCCCG 
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TCATCATAATTACCATTCCTGTTGTTACAAATGTTCTTCCTATTATGGATAAGTGTTTATAG 

TACTGCCATATTAACCGAGAAAATTTCTTCCAGCCACCTACTTCTTCAGAAACGCCAATTCT 

ATCATTCTCACAGAGTCCAGGTGATCCAAGTTCCTTCACCTACTTCTTAGGCATTTTCTTTA 

AATTGCTCATGATGATATCTTATCAAAGCATACTTGGTTTGTTCTCATCCAAATTTGTATTT 

TGATCTGTATGTATCAACGCAAAATAGTTATGTCCATGTTGTCTCCGTTTTATTGCCACTAA 

CCAAAAAATGCATGTTTCTGTGACAAGCCAATGGCGCTTGAGCAAGTAATGTCTGACCGGGA 

TAGCGAGGATGAAGTCGATGACGATGTTGCAGATTTTGAAGATCGCCAGGTATTCCATGATT 

TCTTTCTGCGTTCATTAAATAGACAACAGAAAATGGTATATGATGTAACTTGCTAATGGCTT 

TTGAAACTTAAAAAAGCTGCAGATGCTTGATGACTTTGTGGATGTGAATAAAGATGAAAAGC 

AATTCATGCATCTTTGGAACTCGTTTGTAAGAAAACAAAGGTAACTACTTCTCTTACACATG 

AACAGACACAAAAAGACCTTATGTCTTACATTCCATACCTGTCTAAATGATTTTGCTTATGG 

AACTTTGAGCTCAATTATGATTGTTGATGTTTCAGGGTTATAGCAGATGGTCATATTTCTTG 

GGCATGTGAAGCATTTTCAAGATTTTACGAGAAAGAGTTGCACCGTTACTCATCACTCTTCT 

GGTAATATAAGTACACCAAACATATACAGACACATAACTACACTATCAATCTTGTTTCGTTT 

TCTGAAAAAAAAATAAAAATTTCCAGGTGTTGGAGATTGTTTTTGATTAAACTATGGAACCA 

TGGACTTGTCGACTCAGCCACCATCAACAACTGCAATACCATCCTCGAGAATTGCCGTAATA 

GCTCAGACACCACCACCACCAACAACAACAACAGTGTGGATCGTCCCAGTGACTCAAACACC 

AAC^CAATAACATTGTGGRTCATCCCAATGACATAAACAACAAGAACAATGTTGACAACAA 

GGACAATAACAGCAGAGACAAAGTAATTAAATAGGAAAATCTCCGGCTTTTATGATACCGAT 

TTATCGGATTGTAACTTATTCTTCTTTCTTAAAAAATTGTTTAGGAGCAAACAAATTTTTTA 

TATGTTAGTGTATTCAACTGATTACATTTTTAGTTAAAAAAAAAAATGGATTCTGCTTATAA 

CTAAAAACTGAAAAAAAAGAAAAGTTTCCTTAATTTTTCTTTTTGACTTGAGAAAAAGCTCC 

TCTAGTAAATATGAGTTATATATTAATCAAGTACATAACATAAAAATAGTATATATTAAGTG 

CAAATAGATTGAAAACAAATCAAGAAGAAATTAATTAAGACAGAGTGATTAAGCTTAAAACC 

CCATTTGGACTTGTTCTTTCTCAATGAATCCCTCACAAGCAGCAAGCTTCTTCGATTTTGCT 

TTGACACCACCAATCGGTGTTTTCGAATCTTTCAGGTTTGTCTCGATTTCAATCTAGATCGG 

AGTCAAGTAATAAAATTGATTAACCTAAGTATTCCCGTTCTCTCGTAAGAGTTGGGATTTAG 

CAGTAGATCGGAAATCGGAATTTACGTTTTTGTTAAAAGATTGATGGTTTAGGTAATGGAAC 

ATAGTTCTGGATTCATTGCTTCTAGTTGATTCTCGAATTGTTTGATTTCGCAATGCACATTT 

TTGTTTCAAAGGATCACAGAATTTGATTTAAAATTTGACAAAATTCCATCAATTTCTCATAT 

TAGGGTTTATATTTCTTCTAGTAACTCGAACTTGTTGGAACTCTGTATACTCTGTGCTATGT 

AGATAAAGTCTTAACATTTTGGTCAACTTTGTTTGATCTCTAAACTAGTTTGGGCTCTCTGT 

TTTAAAGTTTTGTGCTTTCACTATTACACAGGTCTCATACAAGACTACAGTCTCAAGAAGCA 

TAATATCGTCGACTCTGTTTTGAGTTTCTCAACAGTGGTTGAAGCTTAAGGAGGTTCTTATG 

TGCGTTTTGATATC 
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SEQ ID NO: 4 

CAAGCTTCTTCAATTTTGCTTGCTCTCTCTCTTACACGGCCAATCGGTGTTTTCGCAGCTTT 
CAGGCCTC^ATACAAGACATTCTATATAAGCATATTGCAGAAGAGGCGGTTCTAATTGTTGC 
ATGGAGTTGAACAATATGACGTAGGGAAATTCTAATTTAGGGGAGGCCTCAGAGTTTGCACT 
AACTTCATAATCAGCTCTGGACGTTGTTGATTGTATTTGAACAAGAATGTGTAGGCAGAATT 
GTCGCGCGAAATCCTCACCGGAGGAAGTGATTTCAACTGATGAGAATCTCTTGATATATTGT 
AAACCTGTTCGACTATATAACATCTTTCACCTTCGCTCTCTAGGCAACCCATCGTTTCTGCC 
AAGATGCTTGAACTACAAAATTGGGGCAAAGCGCAAAAGAAAGTCAAGATCTACTGGGATGG 
TAGTTTTCAACTATAAGGATTGTAATAATACATTACAAAGAACTGAAGTTAGGGAGGATTGT 
TCTTGTCCATTTTGCTCTATGCTATGTGGTAGCTTCAAGGGGCTGCAATTTCATTTGAATTC 
ATCTCATGATTTATTTGAATTTGAGTTCAAGCTTTTGGAAGAATACCAGACAGTTAATGTTT 
CTGTAAAACTTAATTCCTTCATATTTGAGGAAGAAGGAAGTGATGATGATAAATTTGAGCCC 
TTCTCTCTCTGCTCGAAACCTCGTAAGCGTAGACAAAGAGGTGGCAGAAATAACACCAGGAG 
ACTTAAAGTATGCTTTTTACCGTTGGATTCACCCAGTTTAGCTAATGGCACAGAAAATGGAA 
TTGCCCTGCTGAATGATGGAAACCGTGGTTTAGGATATCCCGAGGCAACAGAGCTTGCTGGA 
CAATTTGAGATGACTAGCAACATTCCACCAGCCATAGCCCACTCTTCTCTGGACGCTGGTGC 
TAAAGTTATATTAACAACCGAAGCTGTGGTCCCTGCTACTAAGACAAGAAAGTTATCTGCTG 
AGCGATCAGAGGCTAGAAGCCACCTACTTCTTCAGAAACGCCAATTCTATCATTCTCACAGA 
GTCCAGCCAATGGCGCTTGAGCAAGTAATGTCTGATCGGGATAGCGAGGATGAAGTCGATGA 
CGATGTTGCAGATTTTGAAGATCGCCAGATGCTTGATGACTTTGTGGATGTGAATAAAGATG 
AAAAGCAATTCATGCATCTTTGGAACTCGTTTGTAAGAAAACAAAGGGTTATAGCAGATGGT 
CATATCTCTTGGGCATGTGAAGTATTTTCAAGATTTTACGAGAAAGAGTTGCACTGTTACTC 
ATCACTCTTCTGGTGTTGGAGATTGTTTTTGATTAAACTATGGAACCATGGACTTGTCGACT 
CAGCCACGATCAACAACTGCAATACCATCCTCGAGAATTGCCGTAATACCTCAGTCACTAAC 
AACAACAACAACAGTGTGGATCATCCCAGTGACTCAAACACCAACAACAATAACATTGTGGA 
TCATCCGAATGACATAAAAAACAAGAACAATGTTGACAACAAGGACAATAACAGCAGAGACA 
AGTAATTAAATAGGAAACACTCCGGTTTAGATGATACCGATCTATCGGATTGTAACTTATTC 
TTCTTTCTTAAAAAAATTGTTTAGGAGCAAACAAAGATTTTATTTGTTAGTGTATTCAACTG 
ATTACATTTTTAGTTAAAAAAATGGATTCTCCTTAATAACT 



SEQ ID NO: 5 

MCRQNCRAKSSPEEVISTDENLLIYCKPVRLYNIFHLRSLGNPSFLPRCLNYKIGAKRKRKS 
RSTGMWFOTKDCNNTLQRTEVREDCSCPFCSMLCGSFKGLQFHLNSSHDLFEFEFKLLEEY 
QTVNVS VKLNS F I FEEEGSDDDKFEPFSLCSKPRKRRQRGGRNNTRRLKVCFLPLDSPSLAN 
GTENGIALLNDGNRGLGYPEATELAGQFEMTSNIPPAIAHSSLDAGAKVILTTEAWPATKT 
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RKLSAERSEARSHLLLQKRQFYHSHRVQPMALEQVMSDRDSEDEVDDDVADFEDRQMLDDFV 
DVNKDE KQFMHLWNS F VRKQRV I ADGH I SWACEVFSRF YEKELHCYSSLFWCWRLFL I KLWN 
HGLVD SAT INNCNT I LENCRNT SVTNNNNNS VDHP SD SNTNNNN I VDH PND I KNKOTtfVDNKD 
NNSRDK 

5 

SEQ ID NO: 6 

AAAGAGAAGAGCTTTGACTCTCTCATTGGTCAAACCTGACTGTATTTATATGCGTTATTGTG 
1 0 TGGTAAAGTTTCGACCTTTGACTTGACAAGTTGCCGTTAAGAAGAGAGATGCGTAGATCAGC 
GAGTGGTTCTAGAGTTTTGGATCATTTTCCGGCGACTTCAAGGTCTCCGCCTCGATCTCAGA 
GTGTTACAGCAATGGAAGATGATGTGGAGCTGCTTTTGCCTAGGTACGATCCGAATTCTCAA 
GCGGGGAAGAGAGAGAAATCAAGATTCAGATTTTCAGAAAACGTCATCCATTTGATTCCTCT 
CATTCTTCTTCTCTGTGTCGCAATCCTCTGGCTCTCCTCTTACTCAGGTAAGCCGAGAAATT 
15 GTTTCAATCTCTATGAATCCATAATTGATCTGTGAAACTTAATTAGGGATTTTACAAAGACT 
CATATGGATATGAGGATCGAGATGTCTCTGCAACGTTAGAATCTTGTGTTGAATTATGGTTT 
CAATTTGTTCATATAATACTAAATCGGTGATGGATTTGGAATTTGTCAGCAGCGTTAAGGAG 
TTGAGTTCCAAAAGCAACATGTTGTCTTGTCTCCATGGGAACTCATATTCAGTTTTGGGAAA 
GGAAACAATTCTTTTACCGCCGGTGATTTTGTGCCGCAAACCATTCGTATTTGTAATTTTTG 
20 GTTCTGTAGACACACAAAAGGATCTCTCGTTTTCATGAAATGTATGTTTAATATTTCAGTGA 
TATACATCACACAACTCAAGTAGAAAACACTGATGGTTATCCATTAATCATTCTATTGGTCG 
AAAAAAAGATTAGTTTCAAGTTAATGCCACCTTAGGATTATATGTTCCTGTGAGTTTCAGCT 
AGCCAACTCAACTAGAGTTAAACAATGGAATCAAAATACATATTCAGTAATTTATTTTAAAC 
TCTGACTATTTATGTAAAACACAAATGGAAATCAAAATTGAAGGTCATGAAGATTCTATTCT 

2 5 TAGTGTGAAAAGTATAGATCAATGATTCTTAATTTCTTCATCCTCCACGCATAGATCAATGG 

TGAA.TATGGTTTTAAATCCTCTAATACTCACTGTACTGCCATGGTAGAGTTAAAAAAACAAT 
TTTAGAAATATTAGTGGATTAAGGCATTAAGCTGTCCAAGTTGCTTGTATTTTCTTTTCATT 
TTATTAATTAAAAAAAAAGTTCAACTATTTATTGACTAATAATAATACGTGTTAAATGGTTA 
TCGGTTTAAAATATGGGCCATAGGCCCAGACTTGAAGAAAAACTTGAAACCCAAAGTTTTAT 

3 0 TTTTACTTGTTTTCTTTCTCAGTGAATATCTCCCAATCAAGCTTCTTCAATTTTGCTTGCTC 

TCTCTCTTACACGGCCAATCGGTGTTTTCGCAGCTTTCAGGTTTGTCTCAATCTCAAATTAA 
ATCGGAGTCAAGTAATAACAATTGATAACCCTAATTGTTTCAATTATATTGTAAGATTTGAA 

ATTTTGCAGTAGATCCGGAATCGTATTCTAGTTCTGGAAT CGTTGAT CTCGATGGAATTTTT 
TTTAAGATTTCTTCATACACATTTGGTTCAAAAGATCACATAATTTTATTTTAATTTGATAA 
3 5 GTATGATGATTCTGCTAAGTGGCATTGGATAAAGTTTTCATTTTTGCAATACGTCTAAACTT 
GTCTATGTCTTGAATGAACTCTCTGAGTTGCTTAAAAAGTCTTGTGCTTTCTTTATTACACA 
GGCCTCAATACAAGACATTCTATATAAGCATATTGCAGAAGAGGCGGTTCTAATTGTTGCAT 
GGAGTTGAACAATATGACGTAGGGAAATTCTAATTTAGGGGAGGCCTCAGAGTTTGCACTAA 
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CTTCATAATCAGCTCTGGACGTTGTTGATTGTATTTGAACAAGAATGTGTAGGCAGAATTGT 
CGCGCGAAATCCTCACCGGAGGAAGTGATTTCAACTGATGAGAATCTCTTGATATATTGTAA 
ACCTGTTCGACTATATAACATCTTTCACCTTCGCTCTCTAGGCAACGTATGATTTGGCCTTC 
CTCTCTCATCATTTTAGCTTAGTAATCTTTCATCTCCTGTGTAGATCACCCACTAATAGTTT 
GAGTTTGCTAAGCTGATTATGGTCTGACTCATGGCGAGTGTGTGCTTCTTTTGTCTCCTAAT 
GTTATTTGAACTTGTTGTTTGTTGTTGCAGCCATCGTTTCTGCCAAGATGCTTGAACTACAA 
AATTGGGGCAAAGCGCAAAAGAAAGTATGCGTTTCTTCTTGAATGTAGTTGCCACAGTGATA 
TGTTATTTATCTTACTTCTAATATGGAAGCTGATGAACTATTTATCTTTGTTGAGTAGATAT 
GGACATAATGAATGGTTTCTTCTTTGTTCATGCTATACACTTATATTTTACAAAATTGTGTT 
TTGCTTAGGTCAAGATCTACTGGGATGGTAGTTTTCAACTATAAGGATTGTAATAATACATT 
ACAAAGAACTGAAGGTTAGTCTTTTTCTGTTCTTCGACAAAATTCGATGTCAATGTCTATGT 
TTCTCTAGATGATTTGTTATTTACTATTTTTTTCTGTATTGTCACGCAGTTAGGGAGGATTG 
TTCTTGTCCATTTTGCTCTATGCTATGTGGTAGCTTCAAGGTGGGCAACTATTACAACTGAG 
GTTTCTTCCGGGGCCTTTCATATCTAACACTGTGAAATGCTACTGCCGTTTAATGCTATATA 
CTTTCACTGTTTGGTTACATATTTTTGTGTTTGTTGTTTGTCTTCTTGCTCTTTTTAAACTG 
CTGAGTGTGTGCTTATCTGAGAAAACATGTTCCAGTTCGAGCTTACAATCCATTGTCTTGTG 
TCTATGCAGGGGCTGCAATTTCATTTGAATTCATCTCATGATTTATTTGAATTTGAGTTCAA 
GGTATGTGGTTTTATGGAATTTCTTGTTTTGCCTATGCCGTTAGTAATGAGGTTATAGTTAA 
AAAAGGGTCTTTCCTATTGTAGCTTTTGGAAGAATACCAGACAGTTAATGTTTCTGTAAAAC 
TTAATTCCTTCATATTTGAGGTCAGTTACTTTAAACTTGGTTAATTGGGAAATCCTATAGCT 
GGTGAAAATTTGGTTTATATTCCATCCTTATTTGTACTAGGAAGAAGGAAGTGATGATGATA 
AATTTGAGCCCTTCTCTCTCTGGTAACTCTCAGAACCCCTTGATTAAATACCTTAATAGCAG 
TAACTCCTTGCTTTTCTTGTCAGTACTTCTCTATAAATCCAACCACAATGTTTTGCAGCTCG 
AAACCTCGTAAGCGTAGACAAAGAGGTGGCAGAAATAACACCAGGAGACTTAAAGTATGCTT 
TTTACCGTTGGATTCACCCAGTTTAGCTAATGGCACAGAAAATGGAATTGCCCTGCTGAATG 
ATGGTAAAATCACATCTTCTTCTGTGGTATTCGTTGTGGCTTAGAACTTCATTTTACAGAAG 
AAGATACAATGTCCTGATTGTTTAGTTTTTGTACTTCTCCTCGCATTCTTCTTGTGAGGGTA 
ATGTTACCAGAACTGATGTACAAAATTAATGGCATGCTACAGGAAACCGTGGTTTAGGATAT 
CCCGAGGCAACAGAGCTTGCTGGACAATTTGAGATGACTAGCAACATTCCACCAGCCATAGC 
CCACTCTTCTCTGGACGCTGGTGCTAAAGTTATATTAACAACCGAAGCTGTGGTCCCTGCTA 
CTAAGACAAGAAAGTTATCTGCTGAGCGATCAGAGGCTAGAAGGTTTGTTCATCATGACACC 
CCGTCATCATAATTACCATACCTGTTGTTACAAATGTTGTTCCTATTATGGATAAGTGTTTA 
CTGTACTGCCATATTAACCGAGAAAATTTCTTCCAGCCACCTACTTCTTCAGAAACGCCAAT 
TCTATCATTCTCACAGAGTCCAGGTGATCCAAGTTCCTTCACCTACTTCTTAGGCATTTTCT 
TTAAATTGCTCATGATGATATCTTATCAAAGCATACTTGGTTTGTTCTCATCTAAATTTGTA 
TTTTGATTCTGTATGTATCAACGCAAAAAAATTATGTCCATGTTGTCTCCGTTTTATTGCCA 
CTAACCAAAAACTGCATGTTTCTTGTGACAAGCCAATGGCGCTTGAGCAAGTAATGTCTGAT 
CGGGATAGCGAGGATGAAGTCGATGACGATGTTGCAGATTTTGAAGATCGCCAGGTATTCCA 
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TGATTTCTTTCTGCGTTCATTAAGTAGGCAACAGAAAATGGTATACGATGTJiACTTGCTAAT 
GGCTTTTGAAACTTAAAAAAGCTGCAGATGCTTGATGACTTTGTGGATGTGAATAAAGATGA 
AAAGCAATTCATGCATCTTTGGAACTCGTTTGTAAGAAAACAAAGGTAACTACTTCTCTTAC 
ACTTGAACACACACAAAAAGACCTTATGTCTTACATTCCATACCTGTCTAAATGATTCTGCT 
5 TATGGAACTTTGAGCTCAAATTATGATTGATGTTTGCAGGGTTATAGCAGATGGTCATATCT 
CTTGGGCATGTGAAGTATTTTCAAGATTTTACGAGAAAGAGTTGCACTGTTACTCATCACTC 
TTCTGGTAATATAAGTACACCAAACATATACAGACACATAACTACACTATCAATTTTGTTTC 
GTTTTTCTGAAAGAAAAATAAAAAATTCCAGGTGTTGGAGATTGTTTTTGATTAAACTATGG 
AACCATGGACTTGTCGACTCAGCCACCATCAACAACTGCAATACCATCCTCGAGAATTGCCG 

1 0 TAATACCTCAGTCACTAACAACAACAACAACAGTGTGGATCATCCCAGTGACTCAAACACCA 
ACAACAATAACATTGTGGATCATCCGAATGACATAAAAAACAAGAACAATGTTGACAACAAG 
GACAATAACAGCAGAGACAAGTAATTAAATAGGAAACACTCCGGTTTAGATGATACCGATCT 
ATCGGATTGTAACTTATTCTTCTTTCTTAAAAAAATTGTTTAGGAGCAAACAAAGATTTTAT 
TTGTTAGTGTATTCAACTGATTAGATTTTTAGTTAAAAAAATGGATTCTCCTTAATAACTAA 

1 5 AGACTGAAAAATAAGATAAGTTTCCTTAATTTTTCTTTTTGACTTGAGAAAAAGCTCCTCTA 
GACCTCTAGTAAATAGGAGTTATATATTAATCAAGTACATAACATAAAAATATATATATTAA 
GTGCAAATAGATTGAAAACAAATCAAGAAATTAATTAAGACACAGTGATTAAGCTTAAAACC 

ccattttgacttgttctttctcaatgaatccctcacaagcagcaagcttcttcgattttgct 
ttgacaccaccaatcagtgttttcgaatctttcaggtttgtctcgatttcaaactagatcgg 

2 o agtcaagtgataaaattgactaacataattattcccgttctctcgtaagagttgggatttag 

cagtagatcggaaatcggaatttacgtttttgttaaaagattgatggtttaggtaataaaac 
atagttctggattcattgcttctagttgattctcgaattgtttgatttcgcaatgcacattt 
ttggttgaaaggatcac^taatttgctttaaaatttgacaaaacataccatcaaatttctca 
tatttcttcaagtaactcgaacttgttggaaatctatatactctgggctatgtagataaagt 
25 cttaacattttggtcaacattgtttgttctctaaactagtttgggttctctgttttaaagtt 
tggtgctttcactattacacaggtcttatacaagactacagtctctagaagcataatatcgt 
cgactctgttttgagtttcccaacagtggttgaagcttaaggaggttcttatgtgccttttg 

AAATC 

30 

SEQ ID NO: 7 

caagcttcttcaattttgcttgctctctctcttacacggccaatcggtgttttcgcagcttt 
caggcctcaatacaagacattctatataagcatattgcagaagaggcggttctaattgttgc 

3 5 atggagttgaacaatatgacgtagggaaattctaatttaggggaggcctcagagtttgcact 

aacttcataatcagctctggacgttgttgattgtatttgaacaagaatgtgtaggcagaatt 
gtcgcgcgaaatcctcaccggaggaagtgatttcaactgatgagaatctcttgatatattgt 
aaacctgttcgactatataacatctttcaccttcgctctctaggcaacccatcgtttctgcc 



SUBSTITUTE SHEET (RULE 26) 



WO 00/44918 



75 



PCT/GBOO/00248 



AAGATGCTTGAACTACAAAATTGGGGCAAAGCGCAAAAGAAAGTCAAGATCTACTGGGATGG 
TAGTTTTCAACTATAAGGATTGTAATAATACATTACAAAGAACTGAAGTTAGGGAGGATTGT 
TCTTGTCCATTTTGCTCTATGCTATGTGGTAGCTTCAAGGTGGGCAACTATTACAACTGAGG 
GGCTGCAATTTCATTTGAATTCATCTCATGATTTATTTGAATTTGAGTTCAAGCTTTTGGAA 
GAATACCAGACAGTTAATGTTTCTGTAAAACTTAATTCCTTCATATTTGAGGAAGAAGGAAG 
TGATGATGATAAATTTGAGCCCTTCTCTCTCTGCTCGAAACCTCGTAAGCGTAGACAAAGAG 
GTGGCAGAAATAACACCAGGAGACTTAAAGTATGCTTTTTACCGTTGGATTCACCCAGTTTA 
GCTAATGGCACAGAAAATGGAATTGCCCTGCTGAATGATGGAAACCGTGGTTTAGGATATCC 
CGAGGCAACAGAGCTTGCTGGACAATTTGAGATGACTAGCAACATTCCACCAGCCATAGCCC 
ACTCTTCTCTGGACGCTGGTGCTAAAGTTATATTAACAACCGAAGCTGTGGTCCCTGCTACT 
AAGACAAGAAAGTTATCTGCTGAGCGATCAGAGGCTAGAAGCCACCTACTTCTTCAGAAACG 
CCAATTCTATCATTCTCACAGAGTCCAGCCAATGGCGCTTGAGCAAGTAATGTCTGATCGGG 
ATAGCGAGGATGAAGTCGATGACGATGTTGCAGATTTTGAAGATCGCCAGATGCTTGATGAC 
TTTGTGGATGTGAATAAAGATGAAAAGCAATTCATGCATCTTTGGAACTCGTTTGTAAGAAA 
ACAAAGGGTTATAGCAGATGGTCATATCTCTTGGGCATGTGAAGTATTTTCAAGATTTTACG 
AGAAAGAGTTGCACTGTTACTCATCACTCTTCTGGTGTTGGAGATTGTTTTTGATTAAACTA 
TGGAACCATGGACTTGTCGACTCAGCCACCATCAACAACTGCAATACCATCCTCGAGAATTG 
CCGTAATACCTCAGTCACTAACAACAACAACAACAGTGTGGATCATCCCAGTGACTCAAACA 
CCAACAACAATAAC^TTGTGGATCATCCGAATGACATAAAAAACAAGAACAATGTTGACAAC 
AAGGACAATAACAGCAGAGACAAGTAATTAAATAGGAAACACTCCGGTTTAGATGATACCGA 
TCTATCGGATTGTAACTTATTCTTCTTTCTTAAAAAAATTGTTTAGGAGCAAACAAAGATTT 
TATTTGTTAGTGTATTCAACTGATTACATTTTTAGTTAAAAAAATGGATTCTCCTTAATAAC 
T 



SEQ ID NO: 8 

MCRQNCRAKSS PEEVI STDENLL I YCKPVRLYNI FHLRSLGNPSFLPRCLNYKI GAKRKRKS 
RSTGiyTWFNYKDCNISrrLQRTEWEDCSCPFCSMLCGSFKVGNYYN 



SEQ ID NO: 9 

ACATTTTCGTACCGCTCAAGATTTAAGAAGCGTAAAAGGGTGGAAATCTCAAGTGATAAAAT 
TAGGCATGTACATCCACATATTGTGGATTCAGGATCACCTGAAGATGCCCAGGCAGGATCTG 
AAGACGATTACGTGCAGAGGGAAAATGGTAGTTCTGTAGCACACGCTTCTGTTGATCCTGCT 
AATTCATTACACGGTAGCAATCTTTCAGCACCAACAGTGTTACAGTTTGGGAAGACAAGAAA 
GCTGTCTGTTGAACGAGCTGATCCCAGAAATCGGCAGCTCCTACAAAAACGCCAGTTCTTTC 
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ATTCTCACAGGGCTCAACCAATGGCATTGGGAGCAGTTTTCTCAGATCGTGATAGTGAAGAT 

GAGGTTGATGATGACATTGCTGATTTTGAAGATAGACAGATGCTTGATGATTTTGTTGATGT 
TACCAAAGACGAACTTATTATGCATATGG 



SEQ ID NO: 10 

TFSYRSRFKKRKRVEISSDKIRHVHPHIVDSGSPEDAQAGSEDDYVQRENGSSVAHASVDPA 
NSLHGSNLSAPTVLQFGKTRKLSVERADPRNRQLLQKRQFFHSHRAQPMALGAVFSDRDSED 
EVDDD I ADFEDRQMLDDF VDVTKDEL I MHM 



SEQ ID NO: 11 

ACATGCATATCCTGATGCTGAATGTGCTCAATTGGTACCTGGGAATAATCTTGCACCTCCTG 

CCATGCTACAATTTGCAAAGACAAGAAAATTATCAATTGAACGGTCTGACATGAGAAACCGT 

ACACTCCTTCACAAACGACAATTTTTTCACTCACATAGAGCTCAGCCAATGGCAGCTGAGCA 

AGTTATGTCAGATCGGGATAGTGAGGATGAAGTTGACGATGATGTTGCAGATTTTGAAGACC 

GAAGGATGCTTGATGATTTTGTAGACGTGACTAAAGATGAGAAGCAAATGATGCACTTGTGG 
AACTCATTTGTGAGG 



SEQ ID NO: 12 

HAYPDAECAQLVPGNNLAPPAMLQFAKTRKLS I ERSDMRNRTLLHKRQFFHSHRAQPMAAEQ 
VMSDRDSEDEVDDDVADFEDRRMLDDFVDVTKDEKQMMHLWNSFVR 

SEQ ID NO: 13 

ATGGATCCGATTAAGCTGACAACAGAAGCTAAGGTCCCTGCTAAGCGATCAAAGGCTACAAG 
CCACTACTTGCCTCTTCATAAACGCCAGTTCTATCATTCCCGAACCGGTCAGCCATTGTCAC 
TTGAGCAAGTTATGTCTGACCGAGATAGCGAAAATGACGTCGACAAAAATGATGATGCTGCA 
CATCTCGAAGAAAGCCAGATGCTTAATGGTTCCATGGATGAGAATGAAATCGTAGCAGAGAG 
ATTCATAAAACTTTGGAACTCCTTTGTTAAACAGCAAAGGATTGTTGCAGATGCTCATATTC 
CTTGGGCATGTGAAGCATTCTCAAGATTACACCTGCAAGAGCTGCGCAGTAACTTATCACTC 
GACTTGTGCTGGAGACAATTCATGATCAAACAATGGGATTATGGACTTCTTGACAGAGTCAC 
CATGAACAAATGCAATACCATCATCTACCATAATATCTCAACTACCAA.CGATGACATAAACA 
ATAACAACACAAGGACGACTGATAATATGGATGTTGTCGACGATGACATAAACAGAGACAAG 
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SEQ ID NO: 14 

MDP I KLTTEAKVPAKRSKATSHYLPLHKRQFYHSRTGQPLS LEQVMSDRDSENDVDKNDDAA 
5 HLEESQMLNGSMDENEIVAERFIKLWNSFVKQQRIVADAHIPWACEAFSRLHLQELRSNLSL 
. DLCWRQFMI KQWDYGLLDRVTMNKCNTI I YHNISTTNDDINNNNTRTTDNMDWDDDINRDK 



SEQ ID NO: 15 

10 

CTCTGAGGAGACACTTTTTTTTTCCTCCCTCCTTCCCTCCTCTCCTCCTCCCTTCCCTTCCC 
CTCTCCTCCCCTCTCTCCTCCTTCCCCCCTCGGTCCGCCGGAGCCTGCTGGGGCGAGCGGTT 
GGTATTGCAGGCGCTTGCTCTCCGGGGCCGCCCGGCGGGTAGCTGGCGGGGGGAGGAGGCAG 
GAACCGCGATGGCGCCTCAGAAGCACGGCGGTGGGGGAGGGGGCGGCTCGGGGCCCAGCGCG 
1 5 GGGTCCGGGGGAGGCGGCTTCGGGGGTTCGGCGGCGGTGGCGGCGGCGACGGCTTCGGGCGG 
CAAATCCGGCGGCGGGAGCTGTGGAGGGGGTGGCAGTTACTCGGCCTCCTCCTCCTCCTCCG 
CGGCGGCAGCGGCGGGGGCTGCGGTGTTACCGGTGAAGAAGCCGAAAATGGAGCACGTCCAG 
GCTGACCACGAGCTTTTCCTCCAGGCCTTTGAGAAGCCAACACAGATCTATAGATTTCTTCG 
AACTCGGAATCTCATAGCACCAATATTTTTGCACAGAACTCTTACTTACATGTCTCATCGAA 

2 0 ACTCCAGAACAAACATC^AAAGGAAAACATTTAAAGTTGATGATATGTTATCAAAAGTAGAG 

AAAATGAAAGGAGAGCAAGAATCTCATAGCTTGTCAGCTCATTTGCAGCTTACGTTTACTGG 
TTTCTTCCAGAAAAATGATAAGCCATCACGAAACTCAGAAI^TGAACAAAATTCTGTTACCC 
TGGAAGTCCTGCTTGTGAAAGTTTGCCACAAAAAAAGAAAGGATGTAAGTTGTCCAATAAGG 
C^^GTTCCCACAGGTAAAAAGCAGGTGCCTTTGATTCCTGACCTCAATC^AACAAAACCCGG 
25 AAATTTCCCGTCCCTTGCAGTTTCCAGTAATGAATTTGAACCTAGTAACAGCCATATGGTGA 
AGTCTTACTCGTTGCTATTTAGAGTGACTCGTCCAGGAAGAAGAGAGTTTAATGGAATGATT 
AATGGAGAAACCAATGAAAATATTGATGTCAATGAAGAGCTTCCAGCCAGAAGAAAACGAAA 
TCGTGAGGATGGGGAAAAGACATTTGTTGCACAAATGACAGTATTTGATAAAAACAGGCGCT 
TACAGCTTTTAGATGGGGAATATGAAGTAGCCATGCAGGAAATGGAAGAATGTCCAATAAGC 

3 0 AAGAAAAGAGCAACATGGGAGACTATTCTTGATGGGAAGAGGCTGCCTCCATTCGAAACATT 

TTCTCAGGGACCTACGTTGCAGTTCACTCTTCGTTGGACAGGAGAGACCAATGATAAATCTA 
CGGCTCCTATTGCCAAACCTCTTGCCACTAGAAATTCAGAGAGTCTCCATCAGGAAAACAAG 
CCTGGTTCAGTTAAACCTACTCAAACTATTGCTGTTAAAGAATCATTGACTACAGATCTACA 
AACAAGAAAAGAAAAGGATACTCCAAATGAAAACCGACAAAAATTAAGAATATTTTATCAGT 
3 5 TTCTCTATAACAACAATACAAGGCAACAAACTGAAGCAAGAGATGACCTGCATTGCCCTTGG 
TGTACTCTGAACTGCCGCAAACTTTATAGTTTACTCAAGCATCTTAAACTCTGCCATAGCAG 
ATTTATCTTCAACTATGTTTATCATCCAAAAGGTGCTAGGATAGATGTTTCTATCAATGAGT 
GTTATGATGGCTCCTATGCAGGAAATCCTCAGGATATTCATCGCCAACCTGGATTTGCTTTT 
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AGTCGCAACGGACCAGTTAAGAGAACACCTATCACACATATTCTTGTGTGCAGGCCAAAACG 
AAC^AAAGCAAGCATGTCTGAATTTCTTGAATCTGAAGATGGGGAAGTAGAACAGCAAAGAA 
CATATAGTAGTGGCCACAATCGTCTGTATTTCCATAGTGATACCTGCTTACCTCTCCGTCCA 
CAAGAAATGGAAGTAGATAGTGAAGATGAAAAGGATCCTGAATGGCTAAGAGAAAAAACCAT 
5 TACACAAATTGAAGAGTTTTCTGATGTTAATGAAGGAGAGAAAGAAGTGATGAAACTCTGGA 
ATCTCCATGTCATGAAGCATGGGTTTATTGCTGACAATCAAATGAATCATGCCTGTATGCTG 
TTTGTAGAAAATTATGGACAGAAAATAATTAAGAAGAATTTATGTCGAAACTTCATGCTTCA 
TCTAGTCAGCATGCATGACTTTAATCTTATTAGCATAATGTCAATAGATAAAGCTGTTACCA 
AGCTCCGTGAAATGCAGCAAAAATTAGAAAAGGGGGAATCTGCTTCCCCTGCAAACGAAGAA 

1 0 ATAACTGAAGAAGAAAATGGGACAGCAAATGGATTTAGTGAAATTAACTCAAAAGAGAAAGC 
TTTGGAAACAGATAGTGTCTCAGGGGTTTCAAAACAGAGCAAAAAACAAAAACTCTGAAAAG 
CTCTAACCCCATGTTATGGACAAACACTGAAATTACATTTTAGGGAATTCATCCTCTAAGAA 
TTATGTTTTTGTTTTTAATCATATGTTCCAAACAGGCACTGTTAGATGAAGTAAATGATTTC 
AACAAGGATATTTGTATCAGGGTTCTACTTCACTTCATTATGCAGCATTACATGTATATCAC 

1 5 TTTTATTGATGTCATTAAAA.CATTCTGTACTTTAAGCATGAAAAGCAATATTTCAAAGTATT 
TTTAAACTCAACAAATGTCATCAAATATGTTGAATTGATCTAGAAATTATTTCATATATAAA 
TCAGAATTTTTTTGCATTTATGAACGGCTGTTTTTCTACTTTGTAATTGTGAGACATTTTCT 
TGGGGAGGGAAAATTGGAATGGTTCCCTTTTTTAGAAATTGAAGTGGTCTTCATATGTCAAC 
TACAGAAAAGGAAAAAAATAGAAATTGAAGGATTTTTATGAAATTATATTGCATTACTATTT 

2 0 GCAGTCAAACTTTGATCCTTGTTTTTGAAATCATTTGTCAATTCGGAATGAAAAATTATAAT 
GTAATTTTACATTACATAAGTTCCTTTTACAATTAAAAAATAGCACTTCTTCATCTTATGCC 
TGTTTGAGAAGATATTAAATTTTCACATTGTTGACAGTGAAATGCTATGTTGGTTTATAAGA 
TTACAGACCATTTGTTTTCATGTGGATAATTTTAGTGCATTGCTCACCCGGTATGTTTTTTT 
TTTTTAACTTGAACATTTTGCTTGTTTTGTTTTTCTTTTTTAATTAGATAATCACACGGAAA 

2 5 ATTAAGCTGTTCATATCTTTAAATTAGGATTGCAAACCAAGGAAAGAACGCATTTGAGATTT 

TAAGATGTCACTTATAAGGGGAGAAGTGTTCTTAAAAAGTCAACCAGAAAACTGTTATGCCT 
TTTATTTGTTTGCAAGGATGTCTTTGTAATGTGTTTCATGAATAGAATATCCAATAGAGATA 
AGCTGACTTGAATCATTTTGAGCAATTTTGCCGTGTGTTATATGTGTTTCACGCACATATTT 
GCAGTTGGATTTTCTCCAACAGAAAGTGGATTCACTACTGGCACATTAACAAGCACCAATAG 

3 0 GTTTTTATTCCAACTCCGAGCACTGTGGTTGAGTAACATCACCTCAATTTTTTATTATCCTT 

AAAGATATTGCATTTTCATATTCTTTATTTATAAAGGATCAATGCTGCTGTAAATACAGGTA 
TTTTTAATTTTAAAATTTCATTCCACCACCATCAGATGCAGTTCCCTATTTTGTTTAATGAA 
GGGATATATAAGCTTTCTAATGGTGTCTTCAGAAATTTATAAAATGTAAATACTGATTTGAC 
TGGTCTTTAAGATGTGTTTAACTGTGAGGCTATTTAACGAATAGTGTGGATGTGATTTGTCA 
35 TCCAGTATTAAGTTCTTAGTCATTGATTTTTGTGTTTAAAAAAAAATAGGAAAGAGGGAAAC 
TGCAGCTTTCATTACAGATTCCTTGATTGGTAAGCTCTCCAAATGATGAGTTCTAGTAAACT 
CTGATTTTTGCCTCTGGATAGTAGATCTCGAGCGTTTATCTCGGGCTTTAATTTGCTAAAGC 
TGTGCACATATGTAAAAAI^AAAAAAAAAAGATTATTTTAGGGGAGATGTAGGTGTAGAATT 
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ATTGCTTATGTCATTTCTTAAGCAGTTATGCTCTTAATGCTTAAAAGAAGGCTAGCATTGTT 
TGCACAAAAAGTTGGTGATTCCCACCCCAAATAGTAATAAAATTACTTCTGTTGAGTAAACT 
TTTTATGTCATCGTAAAAGCTGGAAAAATCCCTTTGTTTCTATTTATAAAAAAAGTGCTTTT 
CTATATGTACCCTTGATAACAGATTTTGAAGAAATCCTGTAAGATGATAAAGCATTTGAATG 
GTACAGTAGATGTAAAAAAAATTCAGTTTAAAAGAACATTTGTTTTTACATTAAATGTTTAT 
TTGAAATCAAATGATTTTGTACATAAAGTTCAATAATAT 



SEQ ID NO: 16 

LRRHFFFPPSFPPLLLPSLPLSSPLSSFPPRSAGACWGERLVLQALALRGRPAGSWRGEEAG 
TAMAPQKHGGGGGGGSGPSAGSGGGGFGGSAAVAAATASGGKSGGGSCGGGGSYSASSSSSA 
AAAAGAAVLPVKKPKMEHVQADHELFLQAFEKPTQIYRFLRTRNLIAPIFLHRTLTYMSHRN 
SRTNIKRKTFKVDDMLSKVEKMKGEQESHSLSAHLQLTFTGFFHKNDKPSPNSENEQNSVTL 
EVLLVKVCHKKRKDVS CP I RQVPTGKKQVPL I PDLNQTKPGNFPSLAVS SNEFE P SNSHMVK 

SYSLLFRVTRPGRREFNGMINGETNENIDVNEELPARRKRNREDGEKTFVAQMTVFDKNRRL 
QLLDGEYEVAMQEMEECPISKKRATWETILDGKRLPPFETFSQGPTLQFTLRWTGETNDKST 
APIAKPLATRNSESLHQENKPGSVKPTQTIAVKESLTTDLQTRKEKDTPNENRQKLRIFYQF 
LYMNNTRQQTEARDDLHCPWCTLNCRKLYSLLKHLKLCHSRFIFNYVYHPKGARIDV 
YDGSYAGNPQDIHRQPGFAFSRNGPVKRTPITHILVCRPKRTKASMSEFLESEDGEVEQQRT 
YS SGHNRLYFHSDTCLPLRPQEMEVDS EDEKDPEWLREKT I TQ I EE FSDVNEGEKEVMKLWN 
LHVMKHGF IADNQMNHACMLFVENYGQKI IKKNLCRNFMLHLVSMHDFNLISIMS IDKAVTK 
LREMQQKLEKGESAS PANEE ITEEQNGTANGFSE INS KEKALETDSVSGVSKQSKKQKL 
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CLAIMS : 

1. An isolated nucleic acid obtainable from the VRN2 locus 
of a plant, which nucleic acid encodes a polypeptide which is 
capable of affecting one or more physical characteristics of a 
plant into which the nucleic acid is introduced, the physical 
characteristics being selected from vernalization response, 
flowering time, leaf size and/or shape or shade avoidance 
response. 

2 . A nucleic acid according .to claim 1 which is capable of 
reducing the vernalization requirement of the plant for 
flowering. 

3 . A nucleic acid according to claim 1 or claim 2 which 
comprises a VRN2 nucleotide sequence which encodes a 
polypeptide of SEQ ID No . 2. 

4. A nucleic acid according to claim 3 wherein the VRN2 
nucleotide sequence consists of the sequence of SEQ ID No. 1 
or is degeneratively equivalent thereto. 

5 . A nucleic acid according to claim 1 or claim 2 which 
comprises a VRN2 nucleotide sequence which encodes a 
polypeptide of SEQ ID No. 5. 

6. A nucleic acid according to claim 5 wherein the VRN2 
nucleotide sequence consists of the sequence of SEQ ID No. 4 
or is degeneratively equivalent thereto. 

7. A nucleic acid according to claim 1 or claim 2 which 
comprises a VRN2 nucleotide sequence which encodes a 
polypeptide of SEQ ID No. 8. 

8. A nucleic acid according to claim 7 wherein the VRN2 
nucleotide sequence consists of the sequence of SEQ ID No. 7 
or is degeneratively equivalent thereto. 
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9 . An isolated nucleic acid obtainable from the VRN2 locus 
of a plant which nucleic acid comprises a nucleotide sequence 
of SEQ ID No. 3 or SEQ ID No. 6, or a nucleic acid 
degenerative equivalent thereof. 

5 

10 . A nucleic acid according to claim 9 which comprises a 
sequence having promoter and/or regulatory function. 

11. An isolated nucleic acid which comprises a variant 
10 sequence which is a homologous variant of a nucleotide 

sequence of any one of claims 4, 6 , 8, 9 or 10 and which 
shares at least about 50% identity therewith. 

12 . A nucleic acid according to claim 11 wherein the variant 
15 sequence is capable of capable of affecting one or more 

physical characteristics of a plant into which the nucleic 
acid is introduced, the physical characteristics being 
selected from vernalization response, flowering time, leaf 
size and/or shape or shade avoidance response 

20 

13 . A nucleic acid according to claim 11 or claim 12 wherein 
the variant is an allelic variant of a nucleotide sequence of 
any one of claims 4, 6, 8, 9 or 10, or is a nucleic acid 
degenerative equivalent thereof. 

25 

14 . A nucleic acid according to claim 11 or claim 12 wherein 
the variant sequence is a VRN2 sequence obtainable from a 
plant species other than Arabidopsis thaliana. 

30 15. An isolated nucleic acid sequence which comprises a 

derivative of a nucleotide sequence of any one of claims 4, 
6, 8, 9 or 10, which is a derivative by way of one or more of 
addition, insertion, deletion or substitution of a said 
nucleotide sequence and wherein the derivative sequence is 

35 either capable of affecting one or more physical 

characteristics of a plant into which the nucleic acid is 
introduced, the physical characteristics being selected from 
vernalization response, flowering time, leaf size and/or shape 
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or shade avoidance response or has promoter and/or regulatory 
function. 

16. An isolated nucleic acid which comprises a sequence which 
5 is a fragment of a sequence of any one of claims 1 to 15 and 

either is capable of affecting one or more physical 
characteristics of a plant into which the nucleic acid is 
introduced, the physical characteristics being selected from 
vernalization response, flowering time, leaf size and/or shape 
10 or shade avoidance response or has promoter and/or regulatory 
function. 

17. An isolated nucleic acid which comprises a sequence which 
is the complement of a sequence of any one of claims 1 to 16, 

15 

18. An isolated nucleic acid for use as a probe or primer 
which comprises a sequence that encodes an amino acid sequence 
that is partly, substantially or completely conserved between 
a VRN2 sequence of SEQ ID Nos . 2, 5 or 8 and at least one of 

20 the other sequences shown in Figure 8a or 8b. 

19 . An isolated nucleic acid for use as a probe or primer 
which comprises a sequence that is partly, substantially or 
completely conserved between two or more of the VRN2 

25 nucleotide sequences of claim 4, 6,8 9 or 10 or the 
complements thereof . 

20. A nucleic acid according to claim 18 or claim 19 which is 
15 to 40 nucleotides in length. 

30 

21. A nucleic acid according to claim 20 which is 19 to 28 
nucleotides in length. 

22. A pair of primers each comprising a nucleic acid 
35 according to any one of claims 18 to 21. 
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23. A pair of primers according to claim 22 which is selected 
from: 

VRN2-AP and VRN2-AJ; 
VRN2-A0 and VRN2-AS; and 
VRN2-AI and VRN2-AJ. 

24. A process for producing a nucleic acid which is a 
derivative according to claim 15 which process comprises the 
step of modifying 



25. A method for identifying or cloning a nucleic acid 
according to any one of claims 1 to 17, which method comprises 
using a nucleotide sequence of a probe or primer of claim 18 
or 19 in a database search. 

26. A method for identifying or cloning a nucleic acid 
according to any one of claims 1 to 17, which method employs a 
probe or primer according to any one of claims 18 to 21 or a 
pair of primers according to claim 22 or 23* 

27. A method for determining the presence of a nucleic acid 
according to any one of claims 1 to 17 within the genetic 
make-up of a plant, which method employs a probe or primer 
according to any one of claims 18 to 21 or a pair of primers 
according to claim 22 or 23. 

28. A method according to claim 26 or claim 27, which 
comprises the steps of : 

(a) providing a preparation of nucleic acid from a plant 

cell ; 

(b) providing a nucleic acid molecule which is a probe 
according to any one of claims 19 to 21; 

(c) contacting nucleic acid in said preparation with 
said probe under conditions for hybridisation; and 

(d) identifying a nucleic acid according to any one of 
claims 1 to 17 if present by its hybridisation with said 
nucleic acid probe. 
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29. A method according to claim 26 or claim 27, which method 
comprises the steps of : 

(a) providing a preparation of nucleic acid from a plant 

cell ; 

5 (b) providing a pair of nucleic acid molecule primers 

suitable for PCR, at least one of said primers being a primer 
of any one of claims 18 to 21; 

(c) contacting nucleic acid in said preparation with 
said primers under conditions for performance of PCR; 
1° (d) performing PCR and determining the presence or 

absence of an amplified PCR product. 

30. A method according to claim 29 wherein the pair of 
nucleic acid molecule primers are according to claim 22 or 

15 claim 23 . 

31. A method of selecting a plant having a desired allele of 
the VRN2 gene, which method employs a probe or primer 
according to any one of claims 18 to 21 qr a pair of primers 

20 according to claim 22 or 23. 

32. A recombinant vector which comprises the nucleic acid of 
any one of claims 1 to 17. 

25 33. A vector according to claim 32 wherein the nucleic acid 
comprised in the vector is capable of regulating one or more 
genes involved in the transition from vegetative to 
reproductive growth and/or capable of regulating one or more 
genes involved in the determination of leaf size and/or shape. 

30 

34. A vector according to claim 32-or claim 33 wherein the 
nucleic acid is operably linked to a promoter for 
transcription in a host cell, wherein the promoter is 
optionally an inducible promoter. 

35 

35. A vector according to claim 34 wherein the promoter is a 
constitutive promoter. 
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36. A vector according to any one of claims 32 to 35 which is 
a plant vector. 



10 



37. A method which comprises the step of introducing a vector 
according to any one of claims 32 to 36 into a host cell such 
as to transform the host cell. 

38. A host cell which is transformed with a heterologous 
nucleic acid of any one of claims 1 to 17. 

39. A host cell according to claim 38 which is a plant cell, 
optionally present in a plant. 

40. A method for producing a transgenic plant, which method 
15 comprises the steps of: 

(a) performing a method according to claim 37 

(b) regenerating a plant from the transformed host cell. 



20 



41. A transgenic plant which is obtainable by the method of 
claim 40, or which is a clone, or selfed or hybrid progeny or 
other descendant of said transgenic plant, which in each case 
includes the plant cell of claim 39. 

42. A plant according to claim 41 which is an agricultural or 
25 horticultural plant. 

43. A plant according to claim 41 or claim 42 selected from 
the list consisting of: rice, maize, wheat, barley, oats, rye, 
oil seed rape, sugar beet, sunflower, soybean, sorghum, 
lettuce, endive, cabbage, broccoli, cauliflower, carnation, 
geranium, tobacco, cotton, canola, tomato, mango, peach, 
apple, pear, strawberry, banana, melon, carrot, onion, pea, 
celery. 

35 44. A plant according to claim 43 which is selected from 
tobacco, oil seed rape, rice and wheat. 



30 
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45 . A part or propagule from a plant according to any one of 
claims 41 to 44, which includes the plant cell of claim 39. 

46 . An isolated polypeptide which is encoded by a nucleotide 
5 sequence according to any one of claims 1 to 17. 

47. A polypeptide according to claim 46 which comprises an 
amino acid sequence which consists of the sequence of SEQ ID 
Nos . 2 , 5 or 8 . 

10 

48. A polypeptide according to claim 46 which is a fragment 
of a polypeptide of claim 47. 

49. An isolated polypeptide which consists of the sequence of 
15 SEQ ID Nos. 2, 5 or 8. 

50 . An isolated polypeptide which is a fragment of a 
polypeptide according to claim 47 and which comprises at least 
5 contiguous amino acids of that polypeptide. 

20 

51. A fragment according to claim 50 which comprises amino 
acids 90 to 111 (ziAc finger motif) , amino acids 63 to 132 
(amino terminal region) , amino acids 263 to 366 (carboxy 
terminus) or amino acids 263 to 328 (activation domain) . 

25 

52. A fragment according to any one of claims 48 to 51 which 
is capable of affecting one or more physical characteristics 
of a plant expressing said fragment , the physical 
characteristics being selected from vernalization response, 

30 flowering time, leaf size and/or shape or shade avoidance 
response. 

53 . An isolated nucleic acid which encodes for a fragment 
according to claim 50 or claim 51, or claim 52 as dependent 

35 therefrom. 

54 . An isolated nucleic acid which is the complement of the 
nucleic acid of claim 53 . 



SUBSTITUTE SHEET (RULE 26) 



WO 00/44918 PCT/GBOO/00248 

87 

55. A method of making the polypeptide of any one of claims 
46 to 52, which method comprises the step of causing or 
allowing expression from a nucleic acid of any one of claims 1 
to 17 or 53 in a suitable host cell. 

5 

56. An antibody which has specific binding affinity for a 
polypeptide according to any one of claims 46 to 52. 

57. A polypeptide which comprises the antigen-binding site of 
10 the antibody of claim 56 . 

58. A method for affecting a physical characteristic of a 
plant selected from vernalization response, flowering time, 
leaf size and/or shape or shade avoidance response, which 

15 method comprises either the step of: 

(i) causing or allowing transcription from a nucleic 
acid according to claim 17 or claim 54 in the plant; or 

(ii) causing or allowing transcription from a nucleic 
acid according to any one of claims 1 to 16 or claim 53 . 

20 

59. A method according to claim 58 which method comprises 
causing or allowing transcription from a nucleic acid 
according to any one of claims 1 to 16 or claim 53 thereby to 
reduce VRN2 expression by co-suppression. 
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TCTATATAAGCATATTGCAGAAGAG 155 

AATTTAGGGGAGGCCTCAGAGTTTGCACTAACT^ 230 

ATGTCTAGGCAGAATTGTCGCGCGAAATCCTCACCGG 305 

MCKQKCRAKSSPEEVISTDENLLIY 25 

TGTAAACCTGTTCGACTATATAACATCTTTCACCTTC 380 

CKPVRLYNIFHLRSLGNPSFLPRCL 50 

AACTACAAAATTGGAGCAAAGCGCAAA^ 455 

N Y K I G A IK R K R k| SRSTGMVVFNYKDC 75 

AATAAC ACATTACAGAAAACTG AAOT^ 530 

NNTLQKTEVREDC 5 CPFCSMLCGSF 100 

AAG«3GCTCCAATTTCATT^ 605 

KGLQFHLNSSH DLFEFEFKLFEEYQ 125 

ACAGTTAATGTTTCTGTAAAACTTAAT 680 

TVNVSVKLNSFIFEEEGSDDDKFEP 150 

TTCTCTCTCTGCTCGAAACCT CGTAAG C 755 

F S L C S K P |R K[ RRQRGGRNNT 1 R R L K Vj C 175 



TTTTTACCGTTGGATTCACCCAGTTTAACT^ 830 

FLPLDSPSLTNGTENGITLLNDGNR 200 

GGTTTAGGATATCCCGAGGCAACAGAGCTTGCT^ 905 

GLGYPEATELAGQFEMTSNIPPAIA 225 

CACTCTTCTCTGX3ACGCTGGTGCT 980 

HSSLDAGAKVILTSEAVVPATKTRK 250 

TTATCTGCTGAGCGATCAGAGGCTAGAAGCCACCTACT^^ 1055 

LSAERSEARSHLLLQKRQFYHSHRV 275 

CAGCCAATGGCX^TTGAGCAAGTAATGTCTGACC 1130 

Q P M A L EQVMSD^nSE DEVDDDVADF 300 

4 vm2-1 

GAAGATCGCCA^TGCTTGATGAC^^ 1205 

EDRQMLDDFVDVNKDE KQFMHL(w)NS 325 

TTTGTAAGAAAACAAAGGGTTATAGCAGATGGTCATATCT 1280 

FVRKQRV I ADGHI SWACEAF S RFYE 350 

AAAGAGTTGCACCGTTACTCATCACTCTTCT^ 1355 

KELHRYSSLF -WCWRLFLIKLWNHGL 375 

GTCG ACTC AGCC ACC ATC AACAACTGC AATACC ATCCTCGAGAATTGC TCAG AC ACC ACC ACCAC C 1430 

VDSATINNCNTILENCRNSSDTTTT 400 

AACAACAACAACAGTGTGGATCGTCCCAGTGACTC 1505 

NNNNSVDRPSDSNTNNNNIVDHPND 425 

ATAAACAACAAGAACAATGTTGACAACAAGGACAATAACA^ 1580 

INNKNNVDNKDNNSRDKVIK 445 

CTTTTATGATACCGATTTATCGGATTGTAACT 1655 

TTATATGTTAGTGTATTCAACTGATTACATTTTTAGTTA 1722 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re the Application of 

Caroline Dean et al. 

Serial No. 09/890/220 

Filed: July 27, 2001 

For: "Methods and Means for 
Modification of Plant 
Characteristics Using 
The Vernalization Gene 
VRN2 " 



SUBMISSION OF SEQUENCE LISTING 
UNDER 37 C.F.R. §§1,821-1.825 AND THIRD PRELIMINARY AMENDMENT 

The present submission is in response to the Official 
communication dated August 24, 2001 enclosing a Notification 
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SEQUENCE LISTING 



<110> Dean, Caroline 

Gendal 1 , Anthony 

<120> Methods and means for modification of plant characteristics 
using the vernalisation gene VRN2 * 

<130> Mewburn 

<140> US 09/890,220 
<141> 2001-07-27 

<150> PCT/GBOO/00248 
<151> 2000-01-28 

<150> GB 9901927.5 
<151> 1999-01-28 

<160> 77 

<170> Patentln Ver. 2,1 

<210> 1 
<211> 1722 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 1 

caagcttctt caattttgct tgctctctct tacacagcca atcggtgttt tcgcagcttt 60 
caggcctcaa tccaagacat tctatataag catattgcag aagaggcggt tctaattgtt 120 
gcattgagtt tatcgctatg acgtagggaa attctaattt aggggaggcc tcagagtttg 180 
cactaacttc ataatcggct cttgacgttg ttgagtgtaa ttgaacaaga atgtgtaggc 240 
agaattgtcg cgcgaaatcc tcaccggagg aagtgatttc aactgatgag aatctcttga 300 
tatattgtaa acctgttcga ctatataaca tctttcacct tcgctctcta ggcaacccat 360 
cgtttcttcc aagatgcttg aactacaaaa ttggagcaaa gcgcaaaaga aagtcaagat 420 
ctactgggat ggtagttttc aactataagg attgtaataa cacattacag aaaactgaag 480 
ttagggagga ttgttcttgt ccattttgct ctatgctatg tggtagcttc aaggggctgc 540 
aatttcattt gaattcatct catgatttat ttgaatttga gttcaagctt ttcgaagaat 600 
accagacagt taatgtttct gtaaaactta attccttcat atttgaggaa gaaggaagtg 660 
atgacgataa atttgagccc ttctctctct gctcgaaacc tcgtaagcgg agacaaagag 720 
gtggcagaaa taacaccagg agacttaaag tatgcttttt accgttggat tcacccagtt 780 
taactaatgg cacagaaaat ggaatcaccc tacttaatga tggaaaccgt ggtttaggat 840 
atcccgaggc aacagagctt gctggacaat ttgagatgac cagcaacatt ccaccagcca 900 
tagcccactc ttctctggac gctggtgcta aagttatatt gacaagcgaa gctgtggtcc 960 
ctgctactaa gacaagaaag ttatctgctg agcgatcaga ggctagaagc cacctacttc 1020 
ttcagaaacg ccaattctat cattctcaca gagtccagcc aatggcgctt gagcaagtaa 1080 
tgtctgaccg ggatagcgag gatgaagtcg atgacgatgt tgcagatttt gaagatcgcc 1140 
agatgcttga tgactttgtg gatgtgaata aagatgaaaa gcaattcatg catctttgga 1200 
actcgtttgt aagaaaacaa agggttatag cagatggtca tatctcttgg gcatgtgaag 1260 
cattttcaag attttacgag aaagagttgc accgttactc atcactcttc tggtgttgga 1320 
gattgttttt gattaaacta tggaaccatg gacttgtcga ctcagccacc atcaacaact 1380 
gcaataccat cctcgagaat tgccgtaata gctcagacac caccaccacc aacaacaaca 1440 
acagtgtgga tcgtcccagt gactcaaaca ccaacaacaa taacattgtg gatcatccca 1500 
atgacataaa caacaagaac aatgttgaca acaaggacaa taacagcaga gacaaagtaa 1560 
ttaaatagga aaatctccgg cttttatgat accgatttat cggattgtaa cttattcttc 1620 
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tttcttaaaa aattgtttag gagcaaacaa attttttata tgttagtgta ttcaactgat 1680 
tacattttta gttaaaaaaa aaaatggatt ctgcttataa ct 1722 



<210> 2 
<211> 445 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Cys Arg Gin Asn Cys Arg Ala Lys Ser Ser Pro Glu Glu Val lie 
15 10 15 

Ser Thr Asp Glu Asn Leu Leu lie Tyr Cys Lys Pro Val Arg Leu Tyr 
20 25 30 

Asn lie Phe His Leu Arg Ser Leu Gly Asn Pro Ser Phe Leu Pro Arg 
35 40 45 

Cys Leu Asn Tyr Lys lie Gly Ala Lys Arg Lys Arg Lys Ser Arg Ser 
50 55 60 

Thr Gly Met Val Val Phe Asn Tyr Lys Asp Cys Asn Asn Thr Leu Gin 
65 70 75 80 

Lys Thr Glu Val Arg Glu Asp Cys Ser Cys Pro Phe Cys Ser Met Leu 
85 90 95 

Cys Gly Ser Phe Lys Gly Leu Gin Phe His Leu Asn Ser Ser His Asp 
100 105 110 

Leu Phe Glu Phe Glu Phe Lys Leu Phe Glu Glu Tyr Gin Thr Val Asn 
115 120 125 

Val Ser Val Lys Leu Asn Ser Phe lie Phe Glu Glu Glu Gly Ser Asp 
130 135 140 

Asp Asp Lys Phe Glu Pro Phe Ser Leu Cys Ser Lys Pro Arg Lys Arg 
145 150 155 160 

Arg Gin Arg Gly Gly Arg Asn Asn Thr Arg Arg Leu Lys Val Cys Phe 
165 170 175 

Leu Pro Leu Asp Ser Pro Ser Leu Thr Asn Gly Thr Glu Asn Gly lie 
180 185 190 

Thr Leu Leu Asn Asp Gly Asn Arg Gly Leu Gly Tyr Pro Glu Ala Thr 
195 200 205 

Glu Leu Ala Gly Gin Phe Glu Met Thr Ser Asn lie Pro Pro Ala lie 
210 215 220 

Ala His Ser Ser Leu Asp Ala Gly Ala Lys Val lie Leu Thr Ser Glu 
225 230 235 240 



Ala Val Val Pro Ala Thr Lys Thr Arg Lys Leu Ser Ala Glu Arg Ser 
245 250 255 
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Glu Ala Arg Ser His Leu Leu Leu Gin Lys Arg Gin Phe Tyr His Ser 
260 265 270 

His Arg Val Gin Pro Met Ala Leu Glu Gin Val Met Ser Asp Arg Asp 
275 280 285 

Ser Glu Asp Glu Val Asp Asp Asp Val Ala Asp Phe Glu Asp Arg Gin 
290 295 300 

Met Leu Asp Asp Phe Val Asp Val Asn Lys Asp Glu Lys Gin Phe Met 
305 310 315 320 

His Leu Trp Asn Ser Phe Val Arg Lys Gin Arg Val lie Ala Asp Gly 
325 330 335 

His lie Ser Trp Ala Cys Glu Ala Phe Ser Arg Phe Tyr Glu Lys Glu 
340 345 350 

Leu His Arg Tyr Ser Ser Leu Phe Trp Cys Trp Arg Leu Phe Leu lie 
355 360 365 

Lys Leu Trp Asn His Gly Leu Val Asp Ser Ala Thr lie Asn Asn Cys 
370 375 380 

Asn Thr lie Leu Glu Asn Cys Arg Asn Ser Ser Asp Thr Thr Thr Thr 
385 390 395 400 

Asn Asn Asn Asn Ser Val Asp Arg Pro Ser Asp Ser Asn Thr Asn Asn 
405 410 415 

Asn Asn lie Val Asp His Pro Asn Asp lie Asn Asn Lys Asn Asn Val 
420 425 430 

Asp Asn Lys Asp Asn Asn Ser Arg Asp Lys Val lie Lys 
435 440 445 



<210> 3 
<211> 6338 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 3 

aaagagaatg ctttgactct ctcattggtc aaacctgact gtatttatat gcgttattgt 60 
gtggtaaagt ttcgaccttt gactttacaa gttggcgtta agaagagaga tgcgtagatc 120 
agcgagtggt tcgagagttt tggatcattt tcccccgact tcacggtctc cacgtcgatc 180 
tcagagcatt acatcattgg aagatgatgt ggaggtgctt ttgcctaggt acgatccgaa 240 
ttctcaagcg gggaagagag agaagtcaag attcagattt gcagaaaacg tcatccattt 300 
gattcctctc attcttcttc tctgtatcgc aatcctctgg ctctcctctt attcaggtaa 360 
gccgagaaat tgattcaatc tctatgaatc cataattgat atgtgaaact taattaggga 420 
ttttacaaag gctcatatgg atatgatatg aggatcgaga tgtctctgta acattagaat 480 
cttgtgttga attattgttt caatttgttc atattatact aaaccggtga tggatttgga 540 
atttgtcagc agcgttaagg agttgagttc aagaagcaac atgttgtctt gtctccatgg 600 
gaactcatca tattcagttt tgggaaagga aacaattttt tttaccgccg gtgattatgt 660 
gccgcaaacc atacgtaact tttgtaattt tcggttctgt agacacataa aaggatctct 720 
cgttttcatg aaatgtatgt ttaatatttc actatacatc acacaactca agtagaaaac 780 
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actgatggtt atccattaat catcattcta 
attgctacct tagtgattag atgttcctgt 
aaacaatgga atcaaaatac atattcagta 
acaaaaatgg aaattaaaat tgaaggtcat 
gatcaatgat aaaagtatat accagaacag 
gcacaatata tttaacatat acaaatttta 
cwtctgaatg aagcaaattt atttgatgtg 
ttgatgttag tatagtactc tatgctaaca 
aggtaagaaa agaaaaatag attaatataa 
tgaaagaaga taataactaa ttgactaaaa 
taagtatttc atctttacta taaaatgtaa 
ggcacgtgcc taggtaccaa tagacgtacg 
tttaattcct ctaatactca ctgtactgcc 
atattataat ggattaagct gtccaagttg 
taaacataaa taagttcaac tatttattga 
gtttaaaata tgggccataa ggcccagact 
tttttacttg tttcttcttt ctcagtgaat 
ctctcttaca cagccaatcg gtgttttcgc 
aatcggagtc aagtaataac aattgataaa 
gaaattttgc tgtagatccg gaatcgaatt 
tttttttttt aagatttctt cttacacatt 
atttgataag tatgatgatt ctgctaagtg 
cgtctaaact tgtctatgtc ttgaatgaac 
tctttattac acaggcctca atccaagaca 
ttctaattgt tgcattgagt ttatcgctat 
ctcagagttt gcactaactt cataatcggc 
aatgtgtagg cagaattgtc gcgcgaaatc 
gaatctcttg atatattgta aacctgttcg 
aggcaacgta tgatttgcct tcctctctca 
gtgtagatca cccactaata gtttgagttt 
agtgtgtgct tcttttgtct cctaaatttg 
tcttccaaga tgcttgaact acaaaattgg 
cttgaatgta gctgctacag tgatatgtta 
acctatttta tctttgttga gtagatatgg 
ctataaactt acattttata aaattgtgtt 
gttttcaact ataaggattg taataacaca 
gttcgtcgac aaaattcgat gtcaatgtct 
tttttttctg tattgtcatg cagttaggga 
atgtggtagc ttcaaggtgg gcaactatta 
tctaacactg tgaaatgcta ctgctgtttc 
atttttgtgt ttgttgtttg tcttctcact 
gagaaaacat gtcccagatg gagcttacaa 
aatttcattt gaattcatct catgatttat 
tggaaattct tgatttgcta tgcctttatt 
ctattgtagc tttcggaaga ataccagaca 
atatttgagg tcagttactt taaacttggt 
tttgtttata ttccatcctt atttgtacta 
gcccttctct ctctggtaac cctcagaacc 
tccttgcttc tcttgtcagt acatctctgt 
acctcgtaag cggagacaaa gaggtggcag 
tttaccgttg gattcaccca gtttaactaa 
tgatggtaaa atcatatctt cttctgtgcg 
aagaagatac aatggcctga ttgtttagtt 
gggtattgtt accagaactg atgtacaaaa 
aggatatccc gaggcaacag arcttgctgg 
agccatagcc cactcttctc tggacgctgg 
ggtccctgct actaagacaa gaaagttatc 



ttggtcgaaa acaaggatta gtttcaactt 840 
gagtttcagc tagccaagtc aactagagtt 900 
atttatttta aactctgact atttatgtaa 960 
gaagattcta ttcttagtat gaaaagtata 1020 
tggtggatct agaaacatat ttagtatatg 1080 
atctaaaagt tgtattcatt tatgaaaaga 1140 
ttaatcatcc atttatgtgt taatcagcca 1200 
taattttttt atactataaa ttaaaaaaat 1260 
aaagcatttt attagctgaa ataaataaaa 1320 
aattagtaga gcatatgggg cacaatacac 1380 
caaatttcaa aattatcaaa ctgtatatag 1440 
tccgccctga aataagttgg tgaatatggt 1500 
atggtagagg tgaaaaaaac aattttagaa 1560 
gtcgtatttt ctttacattt tattaactaa 1620 
ctagtaataa tacgtgtaaa atgtctattg 1680 
tgaaaaaaaa acttgaaacc caaagttata 1740 
atctcccaat caagcttctt cgattttgct 1800 
agctttcagg tttgtctcaa tctcaaatta 1860 
cctaattgtt tccattgtat tgtaagattt 1920 
ctagttctgg aatcgttgat ctcgatggaa 1980 
tggttcaaaa gatcacatag ttttatttta 2040 
gcattggata aagttttcgt ttttgcaata 2100 
tctctgagtt gcttaaaaag tcttgtgctt 2160 
ttctatataa gcatattgca gaagaggcgg 2220 
gacgtaggga aattctaatt taggggaggc 2280 
tcttgacgtt gttgagtgta attgaacaag 2340 
ctcaccggag gaagtgattt caactgatga 2400 
actatataac atctttcacc ttcgctctct 2460 
tcatattagc tcagtaatct ttcatctcct 2520 
gctaagctga ttatggtctg attcatggcg 2580 
aacttgttgt ttgttgttgc agccatcgtt 2640 
agcaaagcgc aaaagaaagt atgttttctt 2700 
tttatcttac ttctaatatg gaagctgatg 2760 
acataatgaa tggtttcttc tttgttcatg 2820 
ttggttaggt caagatctac tgggatggta 2880 
ttacagaaaa ctgaaggtta gtctttttct 2940 
atgtttctct agatgatttg ttatttacta 3000 
ggattgttct tgtccatttt gctctatgct 3060 
caactgaggt ttcttccggg gcctttcata 3120 
atgctgtata ctttcactgt ttggttacat 3180 
cttttcgaac tgctgagtgt gtgcttatct 3240 
ccaattgtct tgtgtctatg caggggctgc 3300 
ttgaatttga gttcaaggta tgtggtttta 3360 
aatgaggtta tagttaaaaa agggtctttc 3420 
gttaatgttt ctgtaaaact taattccttc 3480 
taattgggaa atccgatagc tggtgaaaat 3540 
ggaagaagga agtgatgacg ataaatttga 3600 
ccttcgatta aataccttaa tagcagtaac 3660 
aaatccaacc ataatgtttt gcagctcgaa 3720 
aaataacacc aggagactta aagtatgctt 3780 
tggcacagaa aatggaatca ccctacttaa 3840 
ttccttgtgg cttagaactt catattacag 3900 
tttgtacttc tcctcgcatt cttcttgcga 3960 
ttaatggcat gctacaggaa accgtggttt 4020 
acaatttgag atgaccagca acattccacc 4080 
tgctaaagtt atattgacaa gcgaagctgt 4140 
tgctgagcga tcagaggcta gaaggtttgt 4200 
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tcatcatgac accccgtcat cataattacc 
atggataagt gtttatagta ctgccatatt 
tcttcagaaa cgccaattct atcattctca 
tacttcttag gcattttctt taaattgctc 
ttgttctcat ccaaatttgt attttgatct 
tgttgtctcc gttttattgc cactaaccaa 
cgcttgagca agtaatgtct gaccgggata 
attttgaaga tcgccaggta ttccatgatt 
aaatggtata tgatgtaact tgctaatggc 
gatgactttg tggatgtgaa taaagatgaa 
gtaagaaaac aaaggtaact acttctctta 
cttacattcc atacctgtct aaatgatttt 
gttgatgttt cagggttata gcagatggtc 
gattttacga gaaagagttg caccgttact 
aacatataca gacacataac tacactatca 
aaatttccag gtgttggaga ttgtttttga 
cagccaccat caacaactgc aataccatcc 
ccaccaccaa caacaacaac agtgtggatc 
acattgtggr tcatcccaat gacataaaca 
acagcagaga caaagtaatt aaataggaaa 
gattgtaact tattcttctt tcttaaaaaa 
ttagtgtatt caactgatta catttttagt 
aaaaactgaa aaaaaagaaa agtttcctta 
tctagtaaat atgagttata tattaatcaa 
tgcaaataga ttgaaaacaa atcaagaaga 
aaccccattt ggacttgttc tttctcaatg 
tttgctttga caccaccaat cggtgttttc 
tagatcggag tcaagtaata aaattgatta 
tgggatttag cagtagatcg gaaatcggaa 
aggtaatgga acatagttct ggattcattg 
cgcaatgcac atttttgttt caaaggatca 
catcaatttc tcatattagg gtttatattt 
gtatactctg tgctatgtag ataaagtctt 
aactagtttg ggctctctgt tttaaagttt 
aagactacag tctcaagaag cataatatcg 
ttgaagctta aggaggttct tatgtgcgtt 



attcctgttg ttacaaatgt tcttcctatt 4260 
aaccgagaaa atttcttcca gccacctact 4320 
cagagtccag gtgatccaag ttccttcacc 4380 
atgatgatat cttatcaaag catacttggt 4440 
gtatgtatca acgcaaaata gttatgtcca 4500 
aaaatgcatg tttctgtgac aagccaatgg 4560 
gcgaggatga agtcgatgac gatgttgcag 4620 
tctttctgcg ttcattaaat agacaacaga 4680 
ttttgaaact taaaaaagct gcagatgctt 4740 
aagcaattca tgcatctttg gaactcgttt 4800 
cacatgaaca gacacaaaaa gaccttatgt 4860 
gcttatggaa ctttgagctc aattatgatt 4920 
atatttcttg ggcatgtgaa gcattttcaa 4980 
catcactctt ctggtaatat aagtacacca 5040 
atcttgtttc gttttctgaa aaaaaaataa 5100 
ttaaactatg gaaccatgga cttgtcgact 5160 
tcgagaattg ccgtaatagc tcagacacca 5220 
gtcccagtga ctcaaacacc aacaacaata 5280 
acaagaacaa tgttgacaac aaggacaata 5340 
atctccggct tttatgatac cgatttatcg 5400 
ttgtttagga gcaaacaaat tttttatatg 5460 
taaaaaaaaa aatggattct gcttataact 5520 
atttttcttt ttgacttgag aaaaagctcc 5580 
gtacataaca taaaaatagt atatattaag 5640 
aattaattaa gacagagtga ttaagcttaa 5700 
aatccctcac aagcagcaag cttcttcgat 5760 
gaatctttca ggtttgtctc gatttcaatc 5820 
acctaagtat tcccgttctc tcgtaagagt 5880 
tttacgtttt tgttaaaaga ttgatggttt 5940 
cttctagttg attctcgaat tgtttgattt 6000 
cagaatttga tttaaaattt gacaaaattc 6060 
cttctagtaa ctcgaacttg ttggaactct 6120 
aacattttgg tcaactttgt ttgatctcta 6180 
tgtgctttca ctattacaca ggtctcatac 6240 
tcgactctgt tttgagtttc tcaacagtgg 6300 
ttgatatc 6338 



<210> 4 
<211> 1715 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 4 

caagcttctt caattttgct tgctctctct cttacacggc caatcggtgt tttcgcagct 60 
ttcaggcctc aatacaagac attctatata agcatattgc agaagaggcg gttctaattg 120 
ttgcatggag ttgaacaata tgacgtaggg aaattctaat ttaggggagg cctcagagtt 180 
tgcactaact tcataatcag ctctggacgt tgttgattgt atttgaacaa gaatgtgtag 240 
gcagaattgt cgcgcgaaat cctcaccgga ggaagtgatt tcaactgatg agaatctctt 300 
gatatattgt aaacctgttc gactatataa catctttcac cttcgctctc taggcaaccc 360 
atcgtttctg ccaagatgct tgaactacaa aattggggca aagcgcaaaa gaaagtcaag 420 
atctactggg atggtagttt tcaactataa ggattgtaat aatacattac aaagaactga 480 
agttagggag gattgttctt gtccattttg ctctatgcta tgtggtagct tcaaggggct 540 
gcaatttcat ttgaattcat ctcatgattt atttgaattt gagttcaagc ttttggaaga 600 
ataccagaca gttaatgttt ctgtaaaact taattccttc atatttgagg aagaaggaag 660 
tgatgatgat aaatttgagc ccttctctct ctgctcgaaa cctcgtaagc gtagacaaag 720 
aggtggcaga aataacacca ggagacttaa agtatgcttt ttaccgttgg attcacccag 780 
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tttagctaat ggcacagaaa atggaattgc 
atatcccgag gcaacagagc ttgctggaca 
catagcccac tcttctctgg acgctggtgc 
ccctgctact aagacaagaa agttatctgc 
tcttcagaaa cgccaattct atcattctca 
aatgtctgat cgggatagcg aggatgaagt 
ccagatgctt gatgactttg tggatgtgaa 
gaactcgttt gtaagaaaac aaagggttat 
agtattttca agattttacg agaaagagtt 
gagattgttt ttgattaaac tatggaacca 
ctgcaatacc atcctcgaga attgccgtaa 
tgtggatcat cccagtgact caaacaccaa 
cataaaaaac aagaacaatg ttgacaacaa 
taggaaacac tccggtttag atgataccga 
taaaaaaatt gtttaggagc aaacaaagat 
tttttagtta aaaaaatgga ttctccttaa 



cctgctgaat gatggaaacc gtggtttagg 840 
atttgagatg actagcaaca ttccaccagc 900 
taaagttata ttaacaaccg aagctgtggt 960 
tgagcgatca gaggctagaa gccacctact 1020 
cagagtccag ccaatggcgc ttgagcaagt 1080 
cgatgacgat gttgcagatt ttgaagatcg 1140 
taaagatgaa aagcaattca tgcatctttg 1200 
agcagatggt catatctctt gggcatgtga 1260 
gcactgttac tcatcactct tctggtgttg 1320 
tggacttgtc gactcagcca ccatcaacaa 1380 
tacctcagtc actaacaaca acaacaacag 1440 
caacaataac attgtggatc atccgaatga 1500 
ggacaataac agcagagaca agtaattaaa 1560 
tctatcggat tgtaacttat tcttctttct 1620 
tttatttgtt agtgtattca actgattaca 1680 
taact I 715 



<210> 5 
<211> 440 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 5 

Met Cys Arg Gin Asn Cys Arg Ala Lys Ser Ser Pro Glu Glu Val He 
15 10 15 

Ser Thr Asp Glu Asn Leu Leu He Tyr Cys Lys Pro Val Arg Leu Tyr 
20 25 30 

Asn lie Phe His Leu Arg Ser Leu Gly Asn Pro Ser Phe Leu Pro Arg 
35 40 45 

Cys Leu Asn Tyr Lys He Gly Ala Lys Arg Lys Arg Lys Ser Arg Ser 
50 55 60 

Thr Gly Met Val Val Phe Asn Tyr Lys Asp Cys Asn Asn Thr Leu Gin 
65 70 75 80 

Arg Thr Glu Val Arg Glu Asp Cys Ser Cys Pro Phe Cys Ser Met Leu 
85 90 95 

Cys Gly Ser Phe Lys Gly Leu Gin Phe His Leu Asn Ser Ser His Asp 
100 105 HO 

Leu Phe Glu Phe Glu Phe Lys Leu Leu Glu Glu Tyr Gin Thr Val Asn 
115 120 125 

Val Ser Val Lys Leu Asn Ser Phe He Phe Glu Glu Glu Gly Ser Asp 
130 135 140 

Asp Asp Lys Phe Glu Pro Phe Ser Leu Cys Ser Lys Pro Arg Lys Arg 
145 150 155 160 

Arg Gin Arg Gly Gly Arg Asn Asn Thr Arg Arg Leu Lys Val Cys Phe 
165 170 175 
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Leu Pro Leu Asp Ser Pro Ser Leu Ala Asn Gly Thr Glu Asn Gly He 
180 185 190 

Ala Leu Leu Asn Asp Gly Asn Arg Gly Leu Gly Tyr Pro Glu Ala Thr 
195 200 205 

Glu Leu Ala Gly Gin Phe Glu Met Thr Ser Asn He Pro Pro Ala He 
210 215 220 

Ala His Ser Ser Leu Asp Ala Gly Ala Lys Val He Leu Thr Thr Glu 
225 230 235 240 

Ala Val Val Pro Ala Thr Lys Thr Arg Lys Leu Ser Ala Glu Arg Ser 
245 250 255 

Glu Ala Arg Ser His Leu Leu Leu Gin Lys Arg Gin Phe Tyr His Ser 
260 265 270 

His Arg Val Gin Pro Met Ala Leu Glu Gin Val Met Ser Asp Arg Asp 
275 280 285 

Ser Glu Asp Glu Val Asp Asp Asp Val Ala Asp Phe Glu Asp Arg Gin 
290 295 300 

Met Leu Asp Asp Phe Val Asp Val Asn Lys Asp Glu Lys Gin Phe Met 
305 310 315 320 

His Leu Trp Asn Ser Phe Val Arg Lys Gin Arg Val He Ala Asp Gly 
325 330 335 

His He Ser Trp Ala Cys Glu Val Phe Ser Arg Phe Tyr Glu Lys Glu 
340 345 350 

Leu His Cys Tyr Ser Ser Leu Phe Trp Cys Trp Arg Leu Phe Leu He 
355 360 365 

Lys Leu Trp Asn His Gly Leu Val Asp Ser Ala Thr He Asn Asn Cys 
370 375 380 

Asn Thr He Leu Glu Asn Cys Arg Asn Thr Ser Val Thr Asn Asn Asn 
385 390 395 400 

Asn Asn Ser Val Asp His Pro Ser Asp Ser Asn Thr Asn Asn Asn Asn 
405 410 415 

He Val Asp His Pro Asn Asp He Lys Asn Lys Asn Asn Val Asp Asn 
420 425 430 



Lys Asp Asn Asn Ser Arg Asp Lys 
435 440 



8 



<210> 6 
<211> 5895 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 6 

aaagagaaga gctttgactc tctcattggt 
tgtggtaaag tttcgacctt tgacttgaca 
cagcgagtgg ttctagagtt ttggatcatt 
ctcagagtgt tacagcaatg gaagatgatg 
attctcaagc ggggaagaga gagaaatcaa 
tgattcctct cattcttctt ctctgtgtcg 
agccgagaaa ttgtttcaat ctctatgaat 
attttacaaa gactcatatg gatatgagga 
tgttgaatta tggtttcaat ttgttcatat 
gtcagcagcg ttaaggagtt gagttccaaa 
tcatattcag ttttgggaaa ggaaacaatt 
ccattcgtat ttgtaatttt tggttctgta 
aatgtatgtt taatatttca gtgatataca 
ttatccatta atcattctat tggtcgaaaa 
ggattatatg ttcctgtgag tttcagctag 
aaaatacata ttcagtaatt tattttaaac 
atcaaaattg aaggtcatga agattctatt 
ttaatttctt catcctccac gcatagatca 
ctcactgtac tgccatggta gagttaaaaa 
cattaagctg tccaagttgc ttgtattttc 
aactatttat tgactaataa taatacgtgt 
taggcccaga cttgaagaaa aacttgaaac 
tcagtgaata tctcccaatc aagcttcttc 
aatcggtgtt ttcgcagctt tcaggtttgt 
ataacaattg ataaccctaa ttgtttcaat 
atccggaatc gtattctagt tctggaatcg 
cttcatacac atttggttca aaagatcaca 
attctgctaa gtggcattgg ataaagtttt 
gtcttgaatg aactctctga gttgcttaaa 
tcaatacaag acattctata taagcatatt 
agttgaacaa tatgacgtag ggaaattcta 
cttcataatc agctctggac gttgttgatt 
gtcgcgcgaa atcctcaccg gaggaagtga 
gtaaacctgt tcgactatat aacatctttc 
gccttcctct ctcatcattt tagcttagta 
aatagtttga gtttgctaag ctgattatgg 
gtctcctaat gttatttgaa cttgttgttt 
cttgaactac aaaattgggg caaagcgcaa 
ttgccacagt gatatgttat ttatcttact 
tttgttgagt agatatggac ataatgaatg 
attttacaaa attgtgtttt gcttaggtca 
aaggattgta ataatacatt acaaagaact 
aattcgatgt caatgtctat gtttctctag 
ttgtcacgca gttagggagg attgttcttg 
caaggtgggc aactattaca actgaggttt 
aaatgctact gccgtttaat gctatatact 
gttgtttgtc ttcttgctct ttttaaactg 
tccagttcga gcttacaatc cattgtcttg 
attcatctca tgatttattt gaatttgagt 
ttttgcctat gccgttagta atgaggttat 
tttggaagaa taccagacag ttaatgtttc 



caaacctgac tgtatttata tgcgttattg 60 
agttgccgtt aagaagagag atgcgtagat 120 
ttccggcgac ttcaaggtct ccgcctcgat 180 
tggagctgct tttgcctagg tacgatccga 240 
gattcagatt ttcagaaaac gtcatccatt 300 
caatcctctg gctctcctct tactcaggta 360 
ccataattga tctgtgaaac ttaattaggg 420 
tcgagatgtc tctgcaacgt tagaatcttg 480 
aatactaaat cggtgatgga tttggaattt 540 
agcaacatgt tgtcttgtct ccatgggaac 600 
cttttaccgc cggtgatttt gtgccgcaaa 660 
gacacacaaa aggatctctc gttttcatga 720 
tcacacaact caagtagaaa acactgatgg 780 
aaagattagt ttcaacttaa tgccacctta 840 
ccaactcaac tagagttaaa caatggaatc 900 
tctgactatt tatgtaaaac acaaatggaa 960 
cttagtgtga aaagtataga tcaatgattc 1020 
atggtgaata tggttttaaa tcctctaata 1080 
aacaatttta gaaatattag tggattaagg 1140 
ttttcatttt attaattaaa aaaaaagttc 1200 
taaatggtta tcggtttaaa atatgggcca 1260 
ccaaagtttt atttttactt gttttctttc 1320 
aattttgctt gctctctctc ttacacggcc 1380 
ctcaatctca aattaaatcg gagtcaagta 1440 
tatattgtaa gatttgaaat tttgcagtag 1500 
ttgatctcga tggaattttt tttaagattt 1560 
taattttatt ttaatttgat aagtatgatg 1620 
catttttgca atacgtctaa acttgtctat 1680 
aagtcttgtg ctttctttat tacacaggcc 1740 
gcagaagagg cggttctaat tgttgcatgg 1800 
atttagggga ggcctcagag tttgcactaa 1860 
gtatttgaac aagaatgtgt aggcagaatt 1920 
tttcaactga tgagaatctc ttgatatatt 1980 
accttcgctc tctaggcaac gtatgatttg 2040 
atctttcatc tcctgtgtag atcacccact 2100 
tctgactcat ggcgagtgtg tgcttctttt 2160 
gttgttgcag ccatcgtttc tgccaagatg 2220 
aagaaagtat gcgtttcttc ttgaatgtag 2280 
tctaatatgg aagctgatga actatttatc 2340 
gtttcttctt tgttcatgct atacacttat 2400 
agatctactg ggatggtagt tttcaactat 2460 
gaaggttagt ctttttctgt tcttcgacaa 2520 
atgatttgtt atttactatt tttttctgta 2580 
tccattttgc tctatgctat gtggtagctt 2 640 
cttccggggc ctttcatatc taacactgtg 2700 
ttcactgttt ggttacatat ttttgtgttt 2760 
ctgagtgtgt gcttatctga gaaaacatgt 2820 
tgtctatgca ggggctgcaa tttcatttga 2880 
tcaaggtatg tggttttatg gaatttcttg 2940 
agttaaaaaa gggtctttcc tattgtagct 3000 
tgtaaaactt aattccttca tatttgaggt 3060 
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cagttacttt aaacttggtt aattgggaaa tcctatagct ggtgaaaatt tggtttatat 3120 
tccatcctta tttgtactag gaagaaggaa gtgatgatga taaatttgag cccttctctc 3180 
tctggtaact ctcagaaccc cttgattaaa taccttaata gcagtaactc cttgcttttc 3240 
ttgtcagtac ttctctataa atccaaccac aatgttttgc agctcgaaac ctcgtaagcg 3300 
tagacaaaga ggtggcagaa ataacaccag gagacttaaa gtatgctttt taccgttgga 3360 
ttcacccagt ttagctaatg gcacagaaaa tggaattgcc ctgctgaatg atggtaaaat 3420 
cacatcttct tctgtggtat tcgttgtggc ttagaacttc attttacaga agaagataca 3480 
atgtcctgat tgtttagttt ttgtacttct cctcgcattc ttcttgtgag ggtaatgtta 3540 
ccagaactga tgtacaaaat taatggcatg ctacaggaaa ccgtggttta ggatatcccg 3600 
aggcaacaga gcttgctgga caatttgaga tgactagcaa cattccacca gccatagccc 3660 
actcttctct ggacgctggt gctaaagtta tattaacaac cgaagctgtg gtccctgcta 3720 
ctaagacaag aaagttatct gctgagcgat cagaggctag aaggtttgtt catcatgaca 3780 
ccccgtcatc ataattacca tacctgttgt tacaaatgtt cttcctatta tggataagtg 3840 
tttactgtac tgccatatta accgagaaaa tttcttccag ccacctactt cttcagaaac 3900 
gccaattcta tcattctcac agagtccagg tgatccaagt tccttcacct acttcttagg 3960 
cattttcttt aaattgctca tgatgatatc ttatcaaagc atacttggtt tgttctcatc 4020 
taaatttgta ttttgattct gtatgtatca acgcaaaaaa attatgtcca tgttgtctcc 4080 
gttttattgc cactaaccaa aaactgcatg tttcttgtga caagccaatg gcgcttgagc 4140 
aagtaatgtc tgatcgggat agcgaggatg aagtcgatga cgatgttgca gattttgaag 4200 
atcgccaggt attccatgat ttctttctgc gttcattaag taggcaacag aaaatggtat 4260 
acgatgtaac ttgctaatgg cttttgaaac ttaaaaaagc tgcagatgct tgatgacttt 4320 
gtggatgtga ataaagatga aaagcaattc atgcatcttt ggaactcgtt tgtaagaaaa 4380 
caaaggtaac tacttctctt acacttgaac acacacaaaa agaccttatg tcttacattc 4440 
catacctgtc taaatgattc tgcttatgga actttgagct caaattatga ttgatgtttg 4500 
cagggttata gcagatggtc atatctcttg ggcatgtgaa gtattttcaa gattttacga 4560 
gaaagagttg cactgttact catcactctt ctggtaatat aagtacacca aacatataca 4620 
gacacataac tacactatca attttgtttc gtttttctga aagaaaaata aaaaattcca 4680 
ggtgttggag attgtttttg attaaactat ggaaccatgg acttgtcgac tcagccacca 4740 
tcaacaactg caataccatc ctcgagaatt gccgtaatac ctcagtcact aacaacaaca 4800 
acaacagtgt ggatcatccc agtgactcaa acaccaacaa caataacatt gtggatcatc 4860 
cgaatgacat aaaaaacaag aacaatgttg acaacaagga caataacagc agagacaagt 4920 
aattaaatag gaaacactcc ggtttagatg ataccgatct atcggattgt aacttattct 4980 
tctttcttaa aaaaattgtt taggagcaaa caaagatttt atttgttagt gtattcaact 5040 
gattacattt ttagttaaaa aaatggattc tccttaataa ctaaagactg aaaaataaga 5100 
taagtttcct taatttttct ttttgacttg agaaaaagct cctctagacc tctagtaaat 5160 
aggagttata tattaatcaa gtacataaca taaaaatata tatattaagt gcaaatagat 5220 
tgaaaacaaa tcaagaaatt aattaagaca cagtgattaa gcttaaaacc ccattttgac 5280 
ttgttctttc tcaatgaatc cctcacaagc agcaagcttc ttcgattttg ctttgacacc 5340 
accaatcagt gttttcgaat ctttcaggtt tgtctcgatt tcaaactaga tcggagtcaa 5400 
gtgataaaat tgactaacat aattattccc gttctctcgt aagagttggg atttagcagt 5460 
agatcggaaa tcggaattta cgtttttgtt aaaagattga tggtttaggt aataaaacat 5520 
agttctggat tcattgcttc tagttgattc tcgaattgtt tgatttcgca atgcacattt 5580 
ttggttcaaa ggatcacata atttgcttta aaatttgaca aaacatacca tcaaatttct 5640 
catatttctt caagtaactc gaacttgttg gaaatctata tactctgggc tatgtagata 5700 
aagtcttaac attttggtca acattgtttg ttctctaaac tagtttgggt tctctgtttt 5760 
aaagtttggt gctttcacta ttacacaggt cttatacaag actacagtct ctagaagcat 5820 
aatatcgtcg actctgtttt gagtttccca acagtggttg aagcttaagg aggttcttat 5880 
gtgccttttg aaatc 5895 



<210> 7 
<211> 1737 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 7 

caagcttctt caattttgct tgctctctct cttacacggc caatcggtgt tttcgcagct 
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ttcaggcctc aatacaagac attctatata 
ttgcatggag ttgaacaata tgacgtaggg 
tgcactaact tcataatcag ctctggacgt 
gcagaattgt cgcgcgaaat cctcaccgga 
gatatattgt aaacctgttc gactatataa 
atcgtttctg ccaagatgct tgaactacaa 
atctactggg atggtagttt tcaactataa 
agttagggag gattgttctt gtccattttg 
caactattac aactgagggg ctgcaatttc 
ttgagttcaa gcttttggaa gaataccaga 
tcatatttga ggaagaagga agtgatgatg 
aacctcgtaa gcgtagacaa agaggtggca 
ttttaccgtt ggattcaccc agtttagcta 
atgatggaaa ccgtggttta ggatatcccg 
tgactagcaa cattccacca gccatagccc 
tattaacaac cgaagctgtg gtccctgcta 
cagaggctag aagccaccta cttcttcaga 
agccaatggc gcttgagcaa gtaatgtctg 
atgttgcaga ttttgaagat cgccagatgc 
aaaagcaatt catgcatctt tggaactcgt 
gtcatatctc ttgggcatgt gaagtatttt 
actcatcact cttctggtgt tggagattgt 
tcgactcagc caccatcaac aactgcaata 
tcactaacaa caacaacaac agtgtggatc 
acattgtgga tcatccgaat gacataaaaa 
acagcagaga caagtaatta aataggaaac 
attgtaactt attcttcttt cttaaaaaaa 
ttagtgtatt caactgatta catttttagt 



agcatattgc agaagaggcg gttctaattg 120 
aaattctaat ttaggggagg cctcagagtt 180 
tgttgattgt atttgaacaa gaatgtgtag 240 
ggaagtgatt tcaactgatg agaatctctt 300 
catctttcac cttcgctctc taggcaaccc 360 
aattggggca aagcgcaaaa gaaagtcaag 420 
ggattgtaat aatacattac aaagaactga 480 
ctctatgcta tgtggtagct tcaaggtggg 540 
atttgaattc atctcatgat ttatttgaat 600 
cagttaatgt ttctgtaaaa cttaattcct 660 
ataaatttga gcccttctct ctctgctcga 720 
gaaataacac caggagactt aaagtatgct 780 
atggcacaga aaatggaatt gccctgctga 840 
aggcaacaga gcttgctgga caatttgaga 900 
actcttctct ggacgctggt gctaaagtta 960 
ctaagacaag aaagttatct gctgagcgat 1020 
aacgccaatt ctatcattct cacagagtcc 1080 
atcgggatag cgaggatgaa gtcgatgacg 1140 
ttgatgactt tgtggatgtg aataaagatg 1200 
ttgtaagaaa acaaagggtt atagcagatg 1260 
caagatttta cgagaaagag ttgcactgtt 1320 
ttttgattaa actatggaac catggacttg 1380 
ccatcctcga gaattgccgt aatacctcag 1440 
atcccagtga ctcaaacacc aacaacaata 1500 
acaagaacaa tgttgacaac aaggacaata 1560 
actccggttt agatgatacc gatctatcgg 1620 
ttgtttagga gcaaacaaag attttatttg 1680 
taaaaaaatg gattctcctt aataact 1737 



<210> 8 
<211> 107 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 8 

Met Cys Arg Gin Asn Cys Arg Ala Lys Ser Ser Pro Glu Glu Val lie 
15 10 15 

Ser Thr Asp Glu Asn Leu Leu lie Tyr Cys Lys Pro Val Arg Leu Tyr 
20 25 30 

Asn lie Phe His Leu Arg Ser Leu Gly Asn Pro Ser Phe Leu Pro Arg 
35 40 45 

Cys Leu Asn Tyr Lys lie Gly Ala Lys Arg Lys Arg Lys Ser Arg Ser 
50 55 60 

Thr Gly Met Val Val Phe Asn Tyr Lys Asp Cys Asn Asn Thr Leu Gin 
65 70 75 80 

Arg Thr Glu Val Arg Glu Asp Cys Ser Cys Pro Phe Cys Ser Met Leu 
85 90 95 



Cys Gly Ser Phe Lys Val Gly Asn Tyr Tyr Asn 
100 105 



11 



<210> 9 
<211> 463 
<212> DNA 

<213> Oryza sativa 
<400> 9 

acattttcgt accgctcaag atttaagaag cgtaaaaggg tggaaatctc aagtgataaa 60 
attaggcatg tacatccaca tattgtggat tcaggatcac ctgaagatgc ccaggcagga 120 
tctgaagacg attacgtgca gagggaaaat ggtagttctg tagcacacgc ttctgttgat 180 
cctgctaatt cattacacgg tagcaatctt tcagcaccaa cagtgttaca gtttgggaag 240 
acaagaaagc tgtctgttga acgagctgat cccagaaatc ggcagctcct acaaaaacgc 300 
cagttctttc attctcacag ggctcaacca atggcattgg gagcagtttt ctcagatcgt 360 
gatagtgaag atgaggttga tgatgacatt gctgattttg aagatagaca gatgcttgat 420 
gattttgttg atgttaccaa agacgaactt attatgcata tgg 463 



<210> 10 
<211> 154 
<212> PRT 

<213> Oryza sativa 
<400> 10 

Thr Phe Ser Tyr Arg Ser Arg Phe Lys Lys Arg Lys Arg Val Glu lie 
15 10 15 

Ser Ser Asp Lys lie Arg His Val His Pro His lie Val Asp Ser Gly 
20 25 30 

Ser Pro Glu Asp Ala Gin Ala Gly Ser Glu Asp Asp Tyr Val Gin Arg 
35 40 45 

Glu Asn Gly Ser Ser Val Ala His Ala Ser Val Asp Pro Ala Asn Ser 
50 55 60 

Leu His Gly Ser Asn Leu Ser Ala Pro Thr Val Leu Gin Phe Gly Lys 
65 70 75 80 

Thr Arg Lys Leu Ser Val Glu Arg Ala Asp Pro Arg Asn Arg Gin Leu 
85 90 95 

Leu Gin Lys Arg Gin Phe Phe His Ser His Arg Ala Gin Pro Met Ala 
100 105 110 

Leu Gly Ala Val Phe Ser Asp Arg Asp Ser Glu Asp Glu Val Asp Asp 
115 120 125 

Asp lie Ala Asp Phe Glu Asp Arg Gin Met Leu Asp Asp Phe Val Asp 
130 135 140 



Val Thr Lys Asp Glu Leu lie Met His Met 
145 150 
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<210> 11 
<211> 325 
<212> DNA 

<213> Populus tremula x Populus tremuloides 
<400> 11 

acatgcatat cctgatgctg aatgtgctca attggtacct gggaataatc ttgcacctcc 60 
tgccatgcta caatttgcaa agacaagaaa attatcaatt gaacggtctg acatgagaaa 120 
ccgtacactc cttcacaaac gacaattttt tcactcacat agagctcagc caatggcagc 180 
tgagcaagtt atgtcagatc gggatagtga ggatgaagtt gacgatgatg ttgcagattt 240 
tgaagaccga aggatgcttg atgattttgt agacgtgact aaagatgaga agcaaatgat 300 
gcacttgtgg aactcatttg tgagg 325 



<210> 12 
<211> 108 
<212> PRT 

<213> Populus tremula x Populus tremuloides 
<400> 12 

His Ala Tyr Pro Asp Ala Glu Cys Ala Gin Leu Val Pro Gly Asn Asn 
15 10 15 

Leu Ala Pro Pro Ala Met Leu Gin Phe Ala Lys Thr Arg Lys Leu Ser 
20 25 30 

lie Glu Arg Ser Asp Met Arg Asn Arg Thr Leu Leu His Lys Arg Gin 
35 40 45 

Phe Phe His Ser His Arg Ala Gin Pro Met Ala Ala Glu Gin Val Met 
50 55 60 

Ser Asp Arg Asp Ser Glu Asp Glu Val Asp Asp Asp Val Ala Asp Phe 
65 70 75 80 

Glu Asp Arg Arg Met Leu Asp Asp Phe Val Asp Val Thr Lys Asp Glu 
85 90 95 

Lys Gin Met Met His Leu Trp Asn Ser Phe Val Arg 
100 105 



<210> 13 
<211> 558 
<212> DNA 

<213> Arabidopsis thaliana 



<400> 13 

atggatccga 

agccactact 

tcacttgagc 

gctgcacatc 

gcagagagat 

gctcatattc 

aacttatcac 

cttgacagag 



ttaagctgac 
tgcctcttca 
aagttatgtc 
tcgaagaaag 
tcataaaact 
cttgggcatg 
tcgacttgtg 
tcaccatgaa 



aacagaagct 
taaacgccag 
tgaccgagat 
ccagatgctt 
ttggaactcc 
tgaagcattc 
ctggagacaa 
caaatgcaat 



aaggtccctg 
ttctatcatt 
agcgaaaatg 
aatggttcca 
tttgttaaac 
tcaagattac 
ttcatgatca 
accatcatct 



ctaagcgatc 
cccgaaccgg 
acgtcgacaa 
tggatgagaa 
agcaaaggat 
acctgcaaga 
aacaatggga 
accataatat 



aaaggctaca 
tcagccattg 
aaatgatgat 
tgaaatcgta 
tgttgcagat 
gctgcgcagt 
ttatggactt 
ctcaactacc 



60 

120 

180 

240 

300 

360 

420 

480 
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aacqatgaca taaacaataa caacacaagg acgactgata atatggatgt tgtcgacgat 540 

C C Q 

gacataaaca gagacaag 



<210> 14 
<211> 186 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 14 

Met Asp Pro He Lys Leu Thr Thr Glu Ala Lys Val Pro Ala Lys Arg 
15 10 15 

Ser Lys Ala Thr Ser His Tyr Leu Pro Leu His Lys Arg Gin Phe Tyr 
20 25 30 

His Ser Arg Thr Gly Gin Pro Leu Ser Leu Glu Gin Val Met Ser Asp 
35 40 45 

Arg Asp Ser Glu Asn Asp Val Asp Lys Asn Asp Asp Ala Ala His Leu 
50 55 60 

Glu Glu Ser Gin Met Leu Asn Gly Ser Met Asp Glu Asn Glu He Val 
65 70 75 80 

Ala Glu Arg Phe He Lys Leu Trp Asn Ser Phe Val Lys Gin Gin Arg 
85 90 95 

He Val Ala Asp Ala His He Pro Trp Ala Cys Glu Ala Phe Ser Arg 
100 105 HO 

Leu His Leu Gin Glu Leu Arg Ser Asn Leu Ser Leu Asp Leu Cys Trp 
115 120 125 

Arg Gin Phe Met He Lys Gin Trp Asp Tyr Gly Leu Leu Asp Arg Val 
130 135 140 

Thr Met Asn Lys Cys Asn Thr He He Tyr His Asn He Ser Thr Thr 
145 150 155 160 

Asn Asp Asp He Asn Asn Asn Asn Thr Arg Thr Thr Asp Asn Met Asp 
165 170 175 

Val Val Asp Asp Asp He Asn Arg Asp Lys 
180 185 



<210> 15 

<211> 4441 

<212> DNA 

<213> Homo sapiens 

<400> 15 

ctctgaggag acactttttt tttcctccct ccttccctcc tctcctcctc ccttcccttc 60 
ccctctcctc ccctctctcc tccttccccc ctcggtccgc cggagcctgc tggggcgagc 120 
ggttggtatt gcaggcgctt gctctccggg gccgcccggc gggtagctgg cggggggagg 180 
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aggcaggaac cgcgatggcg cctcagaagc 
ccagcgcggg gtccggggga ggcggcttcg 
cttcgggcgg caaatccggc ggcgggagct 
cctcctcctc cgcggcggca gcggcggggg 
tggagcacgt ccaggctgac cacgagcttt 
tctatagatt tcttcgaact cggaatctca 
cttacatgtc tcatcgaaac tccagaacaa 
atatgttatc aaaagtagag aaaatgaaag 
atttgcagct tacgtttact ggtttcttcc 
aaaatgaaca aaattctgtt accctggaag 
gaaaggatgt aagttgtcca ataaggcaag 
ttcctgacct caatcaaaca aaacccggaa 
aatttgaacc tagtaacagc catatggtga 
gtccaggaag aagagagttt aatggaatga 
tcaatgaaga gcttccagcc agaagaaaac 
ttgcacaaat gacagtattt gataaaaaca 
aagtagccat gcaggaaatg gaagaatgtc 
ctattcttga tgggaagagg ctgcctccat 
agttcactct tcgttggaca ggagagacca 
ctcttgccac tagaaattca gagagtctcc 
ctactcaaac tattgctgtt aaagaatcat 
aggatactcc aaatgaaaac cgacaaaaat 
acaatacaag gcaacaaact gaagcaagag 
actgccgcaa actttatagt ttactcaagc 
tcaactatgt ttatcatcca aaaggtgcta 
atggctccta tgcaggaaat cctcaggata 
gcaacggacc agttaagaga acacctatca 
caaaagcaag catgtctgaa tttcttgaat 
catatagtag tggccacaat cgtctgtatt 
cacaagaaat ggaagtagat agtgaagatg 
ccattacaca aattgaagag ttttctgatg 
tctggaatct ccatgtcatg aagcatgggt 
gtatgctgtt tgtagaaaat tatggacaga 
tcatgcttca tctagtcagc atgcatgact 
aagctgttac caagctccgt gaaatgcagc 
ctgcaaacga agaaataact gaagaacaaa 
actcaaaaga gaaagctttg gaaacagata 
aacaaaaact ctgaaaagct ctaaccccat 
gggaattcat cctctaagaa ttatgttttt 
gttagatgaa gtaaatgatt tcaacaagga 
tatgcagcat tacatgtata tcacttttat 
catgaaaagc aatatttcaa agtattttta 
ttgatctaga aattatttca tatataaatc 
tttctacttt gtaattgtga gacattttct 
tttagaaatt gaagtggtct tcatatgtca 
aggattttta tgaaattata ttgcattact 
gaaatcattt gtcaattcgg aatgaaaaat 
ttttacaatt aaaaaatagc -acttcttcat 
tcacattgtt gacagtgaaa tgctatgttg 
tgtggataat tttagtgcat tgctcacccg 
tgcttgtttt gtttttcttt tttaattaga 
ctttaaatta ggattgcaaa ccaaggaaag 
taaggggaga agtgttctta aaaagtcaac 
caaggatgtc tttgtaatgt gtttcatgaa 
aatcattttg agcaattttg ccctgtgtta 
attttctcca acagaaagtg gattcactac 
attccaactc cgagcactgt ggttgagtaa 



acggcggtgg gggagggggc ggctcggggc 240 
ggggttcggc ggcggtggcg gcggcgacgg 300 
gtggaggggg tggcagttac tcggcctcct 360 
ctgcggtgtt accggtgaag aagccgaaaa 420 
tcctccaggc ctttgagaag ccaacacaga 480 
tagcaccaat atttttgcac agaactctta 540 
acatcaaaag gaaaacattt aaagttgatg 600 
gagagcaaga atctcatagc ttgtcagctc 660 
acaaaaatga taagccatca ccaaactcag 720 
tcctgcttgt gaaagtttgc cacaaaaaaa 780 
ttcccacagg taaaaagcag gtgcctttga 840 
atttcccgtc ccttgcagtt tccagtaatg 900 
agtcttactc gttgctattt agagtgactc 960 
ttaatggaga aaccaatgaa aatattgatg 1020 
gaaatcgtga ggatggggaa aagacatttg 1080 
ggcgcttaca gcttttagat ggggaatatg 1140 
caataagcaa gaaaagagca acatgggaga 1200 
tcgaaacatt ttctcaggga cctacgttgc 1260 
atgataaatc tacggctcct attgccaaac 1320 
atcaggaaaa caagcctggt tcagttaaac 1380 
tgactacaga tctacaaaca agaaaagaaa 1440 
taagaatatt ttatcagttt ctctataaca 1500 
atgacctgca ttgcccttgg tgtactctga 1560 
atcttaaact ctgccatagc agatttatct 1620 
ggatagatgt ttctatcaat gagtgttatg 1680 
ttcatcgcca acctggattt gcttttagtc 1740 
cacatattct tgtgtgcagg ccaaaacgaa 1800 
ctgaagatgg ggaagtagaa cagcaaagaa 1860 
tccatagtga tacctgctta cctctccgtc 1920 
aaaaggatcc tgaatggcta agagaaaaaa 1980 
ttaatgaagg agagaaagaa gtgatgaaac 2040 
ttattgctga caatcaaatg aatcatgcct 2100 
aaataattaa gaagaattta tgtcgaaact 2160 
ttaatcttat tagcataatg tcaatagata 2220 
aaaaattaga aaagggggaa tctgcttccc 2280 
atgggacagc aaatggattt agtgaaatta 2340 
gtgtctcagg ggtttcaaaa cagagcaaaa 2400 
gttatggaca aacactgaaa ttacatttta 2460 
gtttttaatc atatgttcca aacaggcact 2520 
tatttgtatc agggttctac ttcacttcat 2580 
tgatgtcatt aaaacattct gtactttaag 2640 
aactcaacaa atgtcatcaa atatgttgaa 2700 
agaatttttt tgcatttatg aacggctgtt 2760 
tggggaggga aaattggaat ggttcccttt 2820 
actacagaaa aggaaaaaaa tagaaattga 2880 
atttgcagtc aaactttgat ccttgttttt 2940 
tataatgtaa ttttacatta cataagttcc 3000 
cttatgcctg tttgagaaga tattaaattt 3060 
gtttataaga ttacagacca tttgttttca 3120 
gtatgttttt tttttttaac ttgaacattt 3180 
taatcacacg gaaaattaag ctgttcatat 3240 
aacgcatttg agattttaag atgtcactta 3300 
cagaaaactg ttatgccttt tatttgtttg 3360 
tagaatatcc aatagagata agctgacttg 3420 
tatgtgtttc acgcacatat ttgcagttgg 3480 
tggcacatta acaagcacca ataggttttt 3540 
catcacctca attttttatt atccttaaag 3600 



15 



atattgcatt ttcatattct ttatttataa 
tttaatttta aaatttcatt ccaccaccat 
gggatatata agctttctaa tggtgtcttc 
actggtcttt aagatgtgtt taactgtgag 
gtcatccagt attaagttct tagtcattga 
ggaaactgca gctttcatta cagattcctt 
agtaaactct gatttttgcc tctggatagt 
ttgctaaagc tgtgcacata tgtaaaaaaa 
aggtgtagaa ttattgctta tgtcatttct 
aggctagcat tgtttgcaca aaaagttggt 
ttctgttgag taaacttttt atgtcatcgt 
tataaaaaaa gtgcttttct atatgtaccc 
gatgataaag catttgaatg gtacagtaga 
tgtttttaca ttaaatgttt atttgaaatc 
t 



aggatcaatg ctgctgtaaa tacaggtatt 3660 
cagatgcagt tccctatttt gtttaatgaa 3720 
agaaatttat aaaatgtaaa tactgatttg 3780 
gctatttaac gaatagtgtg gatgtgattt 3840 
tttttgtgtt taaaaaaaaa taggaaagag 3900 
gattggtaag ctctccaaat gatgagttct 3960 
agatctcgag cgtttatctc gggctttaat 4020 
aaaaaaaaaa gattatttta ggggagatgt 4080 
taagcagtta tgctcttaat gcttaaaaga 4140 
gattcccacc ccaaatagta ataaaattac 4200 
aaaagctgga aaaatccctt tgtttctatt 4260 
ttgataacag attttgaaga aatcctgtaa 4320 
tgtaaaaaaa attcagttta aaagaacatt 4380 
aaatgatttt gtacataaag ttcaataata 4440 

4441 



<210> 16 
<211> 803 
<212> PRT 

<213> Homo sapiens 
<400> 16 

Leu Arg Arg His Phe Phe Phe Pro Pro Ser Phe Pro Pro Leu Leu Leu 
15 10 15 

Pro Ser Leu Pro Leu Ser Ser Pro Leu Ser Ser Phe Pro Pro Arg Ser 
20 25 30 

Ala Gly Ala Cys Trp Gly Glu Arg Leu Val Leu Gin Ala Leu Ala Leu 
35 40 45 

Arg Gly Arg Pro Ala Gly Ser Trp Arg Gly Glu Glu Ala Gly Thr Ala 
50 55 60 

Met Ala Pro Gin Lys His Gly Gly Gly Gly Gly Gly Gly Ser Gly Pro 
65 70 75 80 

Ser Ala Gly Ser Gly Gly Gly Gly Phe Gly Gly Ser Ala Ala Val Ala 
85 90 95 

Ala Ala Thr Ala Ser Gly Gly Lys Ser Gly Gly Gly Ser Cys Gly Gly 
100 105 110 

Gly Gly Ser Tyr Ser Ala Ser Ser Ser Ser Ser Ala Ala Ala Ala Ala 
115 120 125 

Gly Ala Ala Val Leu Pro Val Lys Lys Pro Lys Met Glu His Val Gin 
130 135 140 

Ala Asp His Glu Leu Phe Leu Gin Ala Phe Glu Lys Pro Thr Gin lie 
145 150 155 160 



Tyr Arg Phe Leu Arg Thr Arg Asn Leu lie Ala Pro lie Phe Leu His 
165 170 175 
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Arg Thr Leu Thr Tyr Met Ser His Arg Asn Ser Arg Thr Asn lie Lys 
180 185 190 

Arg Lys Thr Phe Lys Val Asp Asp Met Leu Ser Lys Val Glu Lys Met 
195 200 205 

Lys Gly Glu Gin Glu Ser His Ser Leu Ser Ala His Leu Gin Leu Thr 
210 215 220 

Phe Thr Gly Phe Phe His Lys Asn Asp Lys Pro Ser Pro Asn Ser Glu 
225 230 235 240 

Asn Glu Gin Asn Ser Val Thr Leu Glu Val Leu Leu Val Lys Val Cys 
245 250 255 

His Lys Lys Arg Lys Asp Val Ser Cys Pro lie Arg Gin Val Pro Thr 
260 265 270 

Gly Lys Lys Gin Val Pro Leu lie Pro Asp Leu Asn Gin Thr Lys Pro 
275 280 285 

Gly Asn Phe Pro Ser Leu Ala Val Ser Ser Asn Glu Phe Glu Pro Ser 
290 295 300 

Asn Ser His Met Val Lys Ser Tyr Ser Leu Leu Phe Arg Val Thr Arg 
305 310 315 320 

Pro Gly Arg Arg Glu Phe Asn Gly Met lie Asn Gly Glu Thr Asn Glu 
325 330 335 

Asn lie Asp Val Asn Glu Glu Leu Pro Ala Arg Arg Lys Arg Asn Arg 
340 345 350 

Glu Asp Gly Glu Lys Thr Phe Val Ala Gin Met Thr Val Phe Asp Lys 
355 360 365 

Asn Arg Arg Leu Gin Leu Leu Asp Gly Glu Tyr Glu Val Ala Met Gin 
370 375 380 

Glu Met Glu Glu Cys Pro lie Ser Lys Lys Arg Ala Thr Trp Glu Thr 
385 390 395 400 

He Leu Asp Gly Lys Arg Leu Pro Pro Phe Glu Thr Phe Ser Gin Gly 
405 410 415 

Pro Thr Leu Gin Phe Thr Leu Arg Trp Thr Gly Glu Thr Asn Asp Lys 
420 425 430 

Ser Thr Ala Pro He Ala Lys Pro Leu Ala Thr Arg Asn Ser Glu Ser 
435 440 445 

Leu His Gin Glu Asn Lys Pro Gly Ser Val Lys Pro Thr Gin Thr He 
450 455 460 



Ala Val Lys Glu Ser Leu Thr Thr Asp Leu Gin Thr Arg Lys Glu Lys 
465 470 475 480 
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Asp Thr Pro Asn Glu Asn Arg Gin Lys Leu Arg He Phe Tyr Gin Phe 
485 490 495 

Leu Tyr Asn Asn Asn Thr Arg Gin Gin Thr Glu Ala Arg Asp Asp Leu 
500 505 510 

His Cys Pro Trp Cys Thr Leu Asn Cys Arg Lys Leu Tyr Ser Leu Leu 
515 520 525 

Lys His Leu Lys Leu Cys His Ser Arg Phe He Phe Asn Tyr Val Tyr 
530 535 540 

His Pro Lys Gly Ala Arg He Asp Val Ser He Asn Glu Cys Tyr Asp 
545 550 555 560 

Gly Ser Tyr Ala Gly Asn Pro Gin Asp He His Arg Gin Pro Gly Phe 
565 570 575 

Ala Phe Ser Arg Asn Gly Pro Val Lys Arg Thr Pro He Thr His He 
580 585 590 

Leu Val Cys Arg Pro Lys Arg Thr Lys Ala Ser Met Ser Glu Phe Leu 
595 600 605 

Glu Ser Glu Asp Gly Glu Val Glu Gin Gin Arg Thr Tyr Ser Ser Gly 
610 615 620 

His Asn Arg Leu Tyr Phe His Ser Asp Thr Cys Leu Pro Leu Arg Pro 
625 630 635 640 

Gin Glu Met Glu Val Asp Ser Glu Asp Glu Lys Asp Pro Glu Trp Leu 
645 650 655 

Arg Glu Lys Thr He Thr Gin He Glu Glu Phe Ser Asp Val Asn Glu 
660 665 670 

Gly Glu Lys Glu Val Met Lys Leu Trp Asn Leu His Val Met Lys His 
675 680 685 

Gly Phe He Ala Asp Asn Gin Met Asn His Ala Cys Met Leu Phe Val 
690 695 700 

Glu Asn Tyr Gly Gin Lys He He Lys Lys Asn Leu Cys Arg Asn Phe 
705 710 715 720 

Met Leu His Leu Val Ser Met His Asp Phe Asn Leu He Ser He Met 
725 730 735 

Ser He Asp Lys Ala Val Thr Lys Leu Arg Glu Met Gin Gin Lys Leu 
740 745 750 

Glu Lys Gly Glu Ser Ala Ser Pro Ala Asn Glu Glu He Thr Glu Glu 
755 760 765 

Gin Asn Gly Thr Ala Asn Gly Phe Ser Glu He Asn Ser Lys Glu Lys 
770 775 780 
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Ala Leu Glu Thr Asp Ser Val Ser 
785 790 

Gin Lys Leu 



Gly Val Ser Lys Gin Ser Lys Lys 
795 800 



<210> 17 
<211> 22 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 17 

Cys Pro Phe Cys Ser Met Leu Cys Gly Ser Phe Lys Gly Leu Gin Phe 
15 10 15 

His Leu Asn Ser Ser His 
20 



<210> 18 
<211> 21 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 18 

Cys Pro Phe Cys Ala Glu Ser Tyr Asp lie lie Gly Leu Cys Cys His 
15 10 15 

lie Asp Asp Glu His 
20 



<210> 19 
<211> 20 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 19 

Cys Pro Val Cys Ser Leu Lys Val Gly Val Asp lie Val Ala His lie 
15 10 15 

Thr Leu His His 
20 
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<210> 20 
<211> 21 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 20 

Cys Ser Phe Cys Lys Arg Glu Phe Arg Ser Ala Gin Ala Leu Gly Gly 
15 10 15 

His Met Asn Val His 
20 



<210> 21 
<211> 21 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 21 

Cys Pro Phe Cys Ser Asp Tyr Phe Asp He Val Ser Leu Cys Cys His 
15 10 15 

He Asp Glu Asp His 
20 



<210> 22 
<211> 21 
<212> PRT 

<213> Arabidopsis thaliana 



He Asp Glu Glu His 
20 



<210> 23 
<211> 22 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 23 

Cys Pro lie Cys Leu Arg Lys Phe Asp Asn Leu Gin Ala Leu Asn Ala 
15 10 15 

His Leu Asp Val Glu His 
20 



<400> 22 
Cys Pro Phe 
1 



Cys Ser Asp Asp Tyr Asp Leu Val Glu Leu Cys His His 




20 



<210> 24 
<211> 21 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 24 

Cys Pro He Cys Ser Lys Pro Cys 
1 5 

His Met He He His 
20 



Val Gly Glu Asn Gly Leu Gin Met 
10 15 



<210> 25 
<211> 22 
<212> PRT 

<213> Schizosaccharomyces pombe 
<400> 25 

Cys Pro Tyr Cys Glu He Lys Cys Lys Arg Lys Asp Leu Leu Lys Arg 
15 10 15 

His He Gin Arg Phe His 
20 



<210> 26 
<211> 22 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 26 

Cys Asp Val Cys Ala Phe Lys Cys Ser Ser Tyr Gin Thr Leu Glu Ale 
1 5 10 15 

His Leu Thr Ser Asn His 
20 



<210> 27 
<211> 22 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 27 

Cys Pro Val Cys Glu Leu Val He 
1 5 

His Met Asn Gin Lys His 
20 



Pro Thr Glu Lys Gly Leu Lys Asn 
10 15 
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<210> 28 
<211> 22 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 28 

Cys Pro lie Cys Lys Cys Glu Cys Ser Gly Arg Glu Asp Cys Gin Leu 
1 5 10 15 

His Met Tyr Ala Ser His 
20 



<210> 29 
<211> 22 
<212> PRT 

<213> Drosophila melanogaster 
<400> 29 

Cys Pro Tyr Cys Arg Arg Thr Phe Ser Cys Tyr Tyr Ser Leu Lys Arg 
1 5 10 15 

His Phe Gin Asp Lys His 
20 



<210> 30 
<211> 22 
<212> PRT 

<213> Drosophila melanogaster 
<400> 30 

Cys Pro lie Cys Tyr Ala Val lie Arg Gin Ser Arg Asn Leu Arg Arg 
15 10 15 

His Leu Glu Leu Arg His 
20 



<210> 31 
<211> 22 
<212> PRT 

<213> Drosophila melanogaster 
<400> 31 

Cys Cys Phe Cys Ser Met Cys Phe Glu Ser Val Gin Glu Leu Val Arg 
15 10 15 

His Leu Ser Gly His His 
20 
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<210> 32 

<211> 22 

<212> PRT 

<213> Homo sapiens 

<400> 32 

Cys Pro Phe Cys Arg Ala Leu Phe Lys Ala Lys Thr Ala Leu Glu Ala 
15 10 15 

His lie Arg Ser Arg His 
20 



<210> 33 
<211> 22 
<212> PRT 

<213> Homo sapiens 
<400> 33 

Cys Pro Trp Cys Thr Leu Asn Cys Arg Lys Leu Tyr Ser Leu Leu Lys 
15 10 15 

His Leu Lys Leu Cys His 
20 



<210> 34 

<211> 22 

<212> PRT 

<213> Homo sapiens 

<400> 34 

Cys Glu Val Cys Ala Phe Ala Cys Lys Arg Lys Tyr Glu Leu Gin Lys 
15 10 15 

His Met Ala Ser Gin His 
20 



<210> 35 

<211> 22 

<212> PRT 

<213> Mus musculus 

<400> 35 

Cys Pro Tyr Cys Pro Pro Asn Gly Arg Val Arg Gly Asp Leu Val Glu 
15 10 15 

His Leu Arg Gin Ala His 
20 
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<210> 36 
<211> 21 
<212> PRT 

<213> Mus musculus 
<400> 36 

Cys Arg Phe Cys Ala Lys Val Phe Gly Ser Asp Ser Ala Leu Gin lie 
15 10 15 

His Leu Arg Ser His 
20 



<210> 37 
<211> 22 
<212> PRT 

<213> Rattus norvegicus 
<400> 37 

Cys Asn Tyr Cys Pro Glu Met Phe Ala Asp lie Asn Ser Leu Gin Glu 
15 10 15 

His lie Arg Val Ser His 
20 



<210> 38 
<211> 22 
<212> PRT 

<213> Xiphophorus maculatus 
<400> 38 

Cys Pro His Cys Glu Phe Arg Cys Ala Asp Gin Ser Asn Leu Lys Thr 
15 10 15 

His lie Lys Ser Lys His 
20 



<210> 39 
<211> 39 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 
<222> (1) . . (39) 

<400> 39 

gaa aag caa ttc atg cat ctt tgg aac teg ttt gta aga 39 
Glu Lys Gin Phe Met His Leu Trp Asn Ser Phe Val Arg 
15 10 
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<210> 40 
<211> 13 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 40 

Glu Lys Gin Phe Met His Leu Trp Asn Ser Phe Val Arg 
15 10 



<210> 41 
<211> 39 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 
<222> (1)..(24) 

<400> 41 

gaa aag caa ttc atg cat ctt tga aactcgtttg taaga 39 
Glu Lys Gin Phe Met His Leu 
1 5 



<210> 42 
<211> 7 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 42 

Glu Lys Gin Phe Met His Leu 
1 5 



<210> 43 
<211> 33 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 
<222> (I),. (33) 

<400> 43 

tgt ggt age ttc aag ggg ctg caa ttt cat ttg 33 
Cys Gly Ser Phe Lys Gly Leu Gin Phe His Leu 
15 10 



<210> 44 

<211> 11 

<212> PRT 

<213> Arabidopsis 



thaliana 
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<400> 44 

Cys Gly Ser Phe Lys Gly Leu Gin Phe His Leu 
15 10 



<210> 45 
<211> 54 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 
<222> (1)*.(36) 

<400> 45 

tgt ggt age ttc aag gtg ggc aac tat tac aac tga ggggctgcaa 
Cys Gly Ser Phe Lys Val Gly Asn Tyr Tyr Asn 
15 10 

tttcattt 



<210> 46 
<211> 11 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 46 

Cys Gly Ser Phe Lys Val Gly Asn Tyr Tyr Asn 
15 10 



<210> 47 
<211> 1722 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 47 

caagcttctt caattttget tgctctctct tacacagcca atcggtgttt tegcagcttt 60 
caggcctcaa tccaagacat tctatataag catattgeag aagaggcggt tctaattgtt 120 
gcattgagtt tategctatg aegtagggaa attctaattt aggggaggee tcagagtttg 180 
cactaacttc ataategget cttgacgttg ttgagtgtaa ttgaacaaga atgtgtaggc 240 
agaattgtcg cgcgaaatcc teaceggagg aagtgatttc aactgatgag aatctcttga 300 
tatattgtaa acctgttcga ctatataaca tctttcacct tcgctctcta ggcaacccat 360 
cgtttcttcc aagatgcttg aactacaaaa ttggagcaaa gegcaaaaga aagtcaagat 420 
ctactgggat ggtagttttc aactataagg attgtaataa cacattacag aaaactgaag 480 
ttagggagga ttgttcttgt ecattttget etatgetatg tggtagcttc aaggggctgc 540 
aatttcattt gaattcatct catgatttat ttgaatttga gttcaagctt ttcgaagaat 600 
accagacagt taatgtttct gtaaaactta attccttcat atttgaggaa gaaggaagtg 660 
atgacgataa atttgagece ttctctctct gctcgaaacc tegtaagegg agacaaagag 720 
gtggcagaaa taacaccagg agacttaaag tatgettttt accgttggat tcacccagtt 780 
taactaatgg cacagaaaat ggaatcaccc tacttaatga tggaaaccgt ggtttaggat 840 
atcccgaggc aacagagctt gctggacaat ttgagatgac cagcaacatt ccaccagcca 900 
tagcccactc ttctctggac gctggtgcta aagttatatt gaeaagegaa gctgtggtcc 960 
ctgctactaa gacaagaaag ttatctgctg agegatcaga ggctagaagc cacctacttc 1020 



26 



ttcagaaacg 
tgtctgaccg 
agatgcttga 
actcgtttgt 
cattttcaag 
gattgttttt 
gcaataccat 
acagtgtgga 
atgacataaa 
ttaaatagga 
tttcttaaaa 
tacattttta 



ccaattctat 
ggatagcgag 
tgactttgtg 
aagaaaacaa 
attttacgag 
gattaaacta 
cctcgagaat 
tcgtcccagt 
caacaagaac 
aaatctccgg 
aattgtttag 
gttaaaaaaa 



cattctcaca 
gatgaagtcg 
gatgtgaata 
agggttatag 
aaagagttgc 
tggaaccatg 
tgccgtaata 
gactcaaaca 
aatgttgaca 
cttttatgat 
gagcaaacaa 
aaaatggatt 



gagtccagcc 
atgacgatgt 
aagatgaaaa 
cagatggtca 
accgttactc 
gacttgtcga 
gctcagacac 
ccaacaacaa 
acaaggacaa 
accgatttat 
attttttata 
ctgcttataa 



aatggcgctt 
tgcagatttt 
gcaattcatg 
tatctcttgg 
atcactcttc 
ctcagccacc 
caccaccacc 
taacattgtg 
taacagcaga 
cggattgtaa 
tgttagtgta 
ct 



gagcaagtaa 
gaagatcgcc 
catctttgaa 
gcatgtgaag 
tggtgttgga 
atcaacaact 
aacaacaaca 
gatcatccca 
gacaaagtaa 
cttattcttc 
ttcaactgat 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1722 



<210> 48 
<211> 40 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 48 

gaaaagcaat tcatgcatct ttggaactcg tttgtaagaa 40 



<210> 49 
<211> 40 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 49 

gaaaagcaat tcatgcatct ttgaaactcg tttgtaagaa 40 



<210> 50 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 

<222> (4) • • (7) 

<223> n=a or g or c or t 

<220> 

<223> Description of Artificial Sequence: Xmnl site 

<400> 50 
cttnnnnaag 



<210> 51 
<211> 35 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 51 

gagaagtagt tacctttgtt ttcttacaga agagt 35 



<210> 52 

<211> 40 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: PCR product 
<400> 52 

gaaaagcaat tcatgcatct ttggaactct tctgtaagaa 40 



<210> 53 
<211> 40 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR product 
<400> 53 

gaaaagcaat tcatggatct ttgaaactct tctgtaagaa 40 



<210> 54 
<211> 367 
<212> PRT 

<213> Homo sapiens 
<400> 54 

lie Ala Lys Pro Leu Ala Thr Arg Asn Ser Glu Ser Leu His Gin Glu 
15 10 15 

Asn Lys Pro Gly Ser Val Lys Pro Thr Gin Thr lie Ala Val Lys Glu 
20 25 30 

Ser Leu Thr Thr Asp Leu Gin Thr Arg Lys Glu Lys Asp Thr Pro Asn 
35 40 45 

Glu Asn Arg Gin Lys Leu Arg lie Phe Tyr Gin Phe Leu Tyr Asn Asn 
50 55 60 

Asn Thr Arg Gin Gin Thr Glu Ala Arg Asp Asp Leu His Cys Pro Trp 
65 70 75 80 

Cys Thr Leu Asn Cys Arg Lys Leu Tyr Ser Leu Leu Lys His Leu Lys 
85 90 95 

Leu Cys His Ser Arg Phe lie Phe Asn Tyr Val Tyr His Pro Lys Gly 
100 105 110 



Ala Arg lie Asp Val Ser lie Asn Glu Cys Tyr Asp Gly Ser Tyr Ala 
115 120 125 
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Gly Asn Pro Gin Asp lie His Arg Gin Pro Gly Phe Ala Phe Ser Arg 
130 135 140 

Asn Gly Pro Val Lys Arg Thr Pro lie Thr His lie Leu Val Cys Arg 
145 150 155 160 

Pro Lys Arg Thr Lys Ala Ser Met Ser Glu Phe Leu Glu Ser Glu Asp 
165 170 175 



Gly Glu Val Glu Gin Gin Arg Thr Tyr Ser Ser Gly His Asn Arg Leu 
180 185 190 

Tyr Phe His Ser Asp Thr Cys Leu Pro Leu Arg Pro Gin Glu Met Glu 
195 200 205 

Val Asp Ser Glu Asp Glu Lys Asp Pro Glu Trp Leu Arg Glu Lys Thr 
210 215 220 

lie Thr Gin lie Glu Glu Phe Ser Asp Val Asn Glu Gly Glu Lys Glu 
225 230 235 240 

Val Met Lys Leu Trp Asn Leu His Val Met Lys His Gly Phe lie Ala 
245 250 255 

Asp Asn Gin Met Asn His Ala Cys Met Leu Phe Val Glu Asn Tyr Gly 
260 265 270 

Gin Lys lie lie Lys Lys Asn Leu Cys Arg Asn Phe Met Leu His Leu 
275 280 285 

Val Ser Met His Asp Phe Asn Leu lie Ser lie Met Ser lie Asp Lys 
290 295 300 

Ala Val Thr Lys Leu Arg Glu Met Gin Gin Lys Leu Glu Lys Gly Glu 
305 310 315 320 

Ser Ala Ser Pro Ala Asn Glu Glu He Thr Glu Glu Gin Asn Gly Thr 
325 330 335 

Ala Asn Gly Phe Ser Glu lie Asn Ser Lys Glu Lys Ala Leu Glu Thr 
340 345 350 

Asp Ser Val Ser Gly Val Ser Lys Gin Ser Lys Lys Gin Lys Leu 
355 360 365 
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<210> 55 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Motif 
<400> 55 

Gin Ala Leu Gly Gly 
1 5 



<210> 56 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<210> 57 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 57 

ttgcttgcct gaaaaaagta tg 22 

<210> 58 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<220> 
<223> 



Description of Artificial Sequence: Primer 



<400> 56 

actgttcgtc tccttcatca tg 



22 



<400> 58 

tgtcgatatg cgaccagtac c 



21 



<210> 59 
<211> 21 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 59 

caggcttaga cccaattgac c 

<210> 60 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 60 

aggtaggatc cgacatcgtc ttcttattta ccg 

<210> 61 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 61 

ctcttgaatt caaaactatt cctactctca cac 



<210> 62 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 62 

gccaatcggt gttttcgcag ctttc 



<210> 63 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 63 

aagaataagt tacaatccga taaatcgg 
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<210> 64 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 64 

cagtggttga agcttaagga gg 

<210> 65 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 65 

gcaatgaata aatcataatc ttgg 



<210> 66 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 66 

tctactggga tggtagtttt c 



<210> 67 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 67 

atatcccgag gcaacagagc ttg 

<210> 68 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 



Sequence: Primer 

22 

Sequence : Primer 

24 

Sequence: Primer 

21 

Sequence: Primer 

23 

Sequence: Primer 
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<400> 68 

catctttgga actcgtttg 



<210> 69 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 69 

ctcagttgta atagttgccc 

<210> 70 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 70 

aagagtgggc tatggctgg 



<210> 71 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 71 

gcaactcttt ctcgtaaaat cttg 



<210> 72 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 72 

gcctccataa ctgtcatcac ate 



<210> 73 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 73 

tttcattggt catgggatgg 



<210> 74 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 74 

gacttcagag atgggtttat gc 



<210> 75 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 75 

tccatatcta gctccttcgc c 



<210> 76 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 76 

tgcgttcatt aagtaggcaa cagaaaatgg 



<210> 77 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 77 

gagaagtagt tacctttgtt ttcttacaga agagt 



